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(57) Abstract 



A modified polypeptide corresponding to an envelope glycoprotein of a primate lentivirus is described. The polypeptide has been 
modified from the wild-type structure so that it has cysteine amino acid residues introduced to create disulfide bonds, a cavity is filled with 
hydrophobic amino acids, a Proresidue is introduced at a defined turn structure of the protein, or the hydrophobicity is increased across 
the interface between different domains, while retaining the overall 3-dimensional structure of a discontinuous conserved epitope of the 
wild-type protein. Preferably, the polypeptide has more than one of those characteristics. Preferably, the primate lentivirus is HIV, and the 
protein is HIV-1 gpl20. 
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STABILIZED PRIMATE LENTIVIRUS ENVELOPE GLYCOPROTEINS 

FIELD OF THE INVENTION 

The present invention is directed to envelope polypeptides having a 
structure that approximates conformational discontinuous epitopes of a 
primate lentivirus envelope protein, but as a result of modifications of that 
5 structure has enhanced stability, raises a greater range of antibodies to 
conserved epitopes, and/ or has enhanced immunogenicity for broadly 
neutralizing epitopes. 
BACKGROUND OF THE INVENTION 



10 acquired immunodeficiency syndrome (AIDS), which is characterized by the 
depletion of CD4-positive l3aTLphocytes (See, Barre-Sinoussi, F., et al., 
"Isolation of a T-lymphotropic Retrovirus From a Patient at Risk for Acquired 
Immunodeficiency Syndrome (AIDS)," Science 220:868-871 (1983); Gallo, 
RC, et al., "Frequent Detection and Isolation of Cytopathic Retroviruses 

15 (HTLV-III) From Patients with AIDS and at Risk for AIDS," Science 224:500- 
503 (1984)). Infection of humans by HIV-1 typically involves an initial 
period of acute, high-level viremia, followed by a chronic, low-level viremia 
(See, Coombs, RW, et al., "Plasma Viremia in Human Immunodeficiency 
Virus Infection," K Engl J. Med. 321:1626-1631 (1989); Clark, SJ, et al,, 

20 "High titers of Cytopathic Virus in Plasma from Patients with Symptomatic 
Primacy HIV-1 Infection," TV. Engl J. Med. 324:950-960 (1991); Daar, ES, et 
al., "Transient High Levels of Viremia in Patients with Primary 
immunodeficiency Virus Tjrpe 1 Infection," N. Engl J, Med, 324:961-964 
(1991); Fauci, AS, et al., "Immunopathogenic Mechanisms of HIV Infection," 

25 Ann, Inter, Med. 124:654-663 (1996)). It is thought that the antiviral 

immune response helps to determine the "set-point" for chronic viremia. 
HIV-1 persistence results in progressive CD4-positive lymphocyte decline, 
which ultimately compromises the immune response, including that 
directed against HIV- 1 . The resulting resurgence of high-level viremia is a 
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harbinger of poor clinical outcome (See, Ho, DD, et al., "Quantitation of 
Human Immunodeficiency Virus Type 1 in the Blood of Infected Persons," N. 
Engl J. Med, 321:1621-1525 (1989)). 

The envelope protein of a lentivirus is the most visible portion of the 
5 virion because it is on the surface of the virus particle.. Thus, considerable 
attention has focussed on the envelope protein as a target for inhibiting viral 
entry. Strategies that have been used include using the envelope protein to 
generate an immune response, decoys for the envelope protein, etc. These 
approaches have not yet been successful. 

10 It was recently reported that a large scale clinical trial was going to be 

attempted with an HIV envelope protein as an immunogen. While the initial 
trials with the protein have not been reported to be promising in terms of 
showing any significant protective immunity, they have also not indicated 
any significatnt harm caused by the vaccine candidate. The fact that a 

1 5 clinical triad with this type of preliminary results would be attempted shows 
the importance placed upon the use of the envelope protein and 
underscores the need for improvements in enhancing the inmiunogenicity of 
the envelope protein. 

The envelope protein is an attractive target because, like that of other 

20 retroviruses, the entry of HIV- 1 into target cells is mediated by the viral 
envelope glycoproteins, gpl20 and gp41, which are derived from a gpl60 
precursor (See, Allan, JS, et al., "Major Glycoprotein Antigens That Induce 
Antibodies in AIDS Patients are Encoded by HTLV-III," Science 228:1091- 
1093 (1985); Robey, WG., et al., "Characterization of Envelope and Core 

25 Structural Gene Products of HTLV-III with Sera from AIDS Patients," Science 
228:593-595 (1985)). The gpl60 glycoprotein is created by the addition of 
N-linked, high mannose sugar chains to the approximately 845-870 amino 
acid primary translation product of the env gene in the rough endoplasmic 
reticulum. Trimerization of gpl60 in the endoplasmic reticulum is mediated 

30 by the formation of a coiled coil within the gp41 ectodomain. (See, Earl, PL., 
et al., "Oligomeric Structure of the Human Immunodeficiency Virus Type 1 
Envelope Glycoprotein," Proc. Natl Acad, Sci. USA 87:648-652 (1990); Pinter, 
A., et al., "Oligomeric Structure of gp41, the Transmembrane Protein of 
Human Immunodeficiency Virus Type 1," J. Virol 63:2674-2679; Lu, M., et 
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al., "A Trimeric Structural Domain of the HIV-1 Transmembrane 
Glycoprotein," Nature Structural Biol 2:1075-1082 (1995); Chan, DC, et al., 
"Core Structure of gp41 from the HIV Envelope Glycoprotein," Cell 89:263- 
273 (1997); and Weissenhom, W., et al., "Atomic Structure of the 
5 Ectodomain from HIV-1 gp41," Nature 387:426-430 (1997)). The gpl60 

trimers are transported to the Golgi apparatus, where cleavage by a cellular 
protease generates the mature gpl20 and gp41 glycoproteins, which remain 
associated through non-covalent interactions (Earl, PL, et al., "Folding, 
Interaction with GRP78-BiP, Assembly and Transport of the Human 

10 Immunodeficiency Virus Type 1 Envelope Protein," J, Virol. 65:2047-2055 
(1991); and Kowalski, M., et al., "Functional Regions of the Envelope 
Glycoprotein of Human Immunodeficiency Virus Type 1," Science 237:1351- 
1355 (1987)). In mammalian host cells, addition of complex sugars to 
selected, probably surface-exposed, carbohydrate side chains of the 

15 envelope glycoproteins occurs in the Golgi apparatus. (See, Leonard, CK, et 
al., "Assignment of Intrachain Disulfide Bonds and Characterization of 
Potential glycosylation Sites of the Type 1 Recombinant Human 
Immunodeficiency Virus Envelope Glycoprotein (gpl20) Expressed in 
Chinese Hamster Ovary Cells," J. Biol Chem, 265:10373-10382 (19990)). 

20 Most of the surface-exposed elements of the oligomeric envelope 

glycoprotein complex are contained on the gpl20 exterior envelope 
glycoprotein. (See, Moore, J., et al., "Probing the Structure of the Human 
Immunodeficiency Virus Surface Glycoprotein gpl20 with a Panel of 
Monoclonal Antibodies," J, Virol. 68:469-484 (1994)). When the gpl20 

25 glycoproteins derived from different primate immunodeficiency viruses are 
compared, five conserved regions (CI to C5) and five variable regions (VI to 
V5) can be identified. (See, Starcich, BR, et al., "Identification and 
Characterization of Conserved and Variable Regions of the Envelope Gene 
HTLV-III/LAV, the Retrovirus of AIDS," CeZZ 45:637-648 (1986); Myers, G., et 

30 al. "Human Retroviruses and AIDS: A Compilation and Analysis of Nucleic 
Acid and Amino Acid Sequences," Los Alamos National Laboratory, (1994)). 
Intramolecular disulfide bonds in the gpl20 glycoprotein result in the 
incorporation of the first four variable regions into large, loop- like 
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Structures. Antibody binding studies and deletion mutagenesis have 
indicated that the major variable loops are well-exposed on the surface of 
the gpl20 glycoprotein. (See, Wyatt, R., et al., "Functional and Immunologic 
Characterization of Human Immunodeficiency Virus Type 1 Envelope 
5 Glycoproteins Containing Deletions of the Major Variable Regions," J. ViroL 
67:4557-4565 (1993); Pollard, S., et al., "Truncated Variants of gpl20 bind 
CD4 with High Affinity and Suggest a Minimum CD4 Binding Region," 
EMBOJ. 11:585-591 (1992)). 

The mature envelope glycoprotein complex is incorporated into HIV- 1 

10 virions, where it mediates virus entry into the host cell. The gpl20 exterior 
glycoprotein binds the CD4 glycoprotein, which serves as the primary 
receptor. (See, Klatzmann, D., et al., "T-lymphocyte T4 Molecule Behaves as 
the Receptor for Human Retrovirus LAV," Nature London 3 12:767-768 
(1984); and Dalgleish, AG., et al., "The CD4 (T4) Antigen is an Essential 

15 Component of the Receptor for the AIDS Retrovirus," Nature 312: 763-767 

(1984)). The association of gpl20 with CD4 is mediated by the interaction of 
a discontinuous gpl20 structure with the CDR2-like region of the CD4 
amino-terminal domain. (See, Brodsky, MH., et al., "Analysis of the Site in 
CD4 that Binds to the HIV Envelope Glycoprotein," J, Immunol 144: 3078- 

20 3 086 (1990); Peterson, A., et al., "Genetic analysis of Monoclonal Antibody 
and HIV binding Sites on the Human Lymphocyte Antigen CD4," Cell 54:65- 
72 (1988); Moebius, U., et al., "The Human Immunodeficiency Virus gpl20 
Binding Site on CD4: Delineation by quantitative Equilibrium and Kinetic 
Binding Studies of Mutants in Conjunction with a High- Resolution CD4 

25 Atomic Structure," J. Exp. Med, 176:507-517 (1982); Arthos, J., et al., 
"Identification of the Residues in Human CD4 Critical for the binding of 
HIV," Cell 57:469 (1989); Ryu SE., et al., "Crystal Structure of an HIV- 
/ binding Recombinant Fragment of Human CD4," Nature London 348:419- 

' 425 (1990); and Wang, J., et al., "Atomic Structure of a Fragment of Human 

30 CD4 containing Two immunoglobulin-like Domains," Nature London 

348:411-418 (1990)). Amino acids in the gpl20 C3 and C4 regions have 
been implicated in CD4 binding. (See, Cordonnier, A., et al., "Single Amino 
Acid Changes in HIV Envelope Affect Viral Tropism and Receptor Binding, 
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Nature 340:571-574 (1989); Lasky, L., et al., "Delineation of a Region of the 
Human Immunodeficiency Virus Type 1 gpl20 Glycoprotein Critical for 
Interaction with the CD4 Receptor," Cell 50:975-985 (1987); and Olshevsky, 
U., et al., "Identification of Individual HIV-1 gpl20 Amino Acids Important 
5 for CD4 Receptor Binding," J. ViroL 64:5701-5707 (1990)). The association 
of gpl20 with CD4 is believed to initiate conformational changes in the HIV- 
1 envelope glycoprotein complex, leading to interactions with members of 
the chemokine receptor family. (See, Sattentau, Q., et al., "Conformational 
Changes Induced in the Human Immunodeficiency Virus Envelope 

10 Glycoprotein by Soluble CD4 binding," J. Exp. Med. 174:407-415 (1991); 
Thali, M., et al., "Characterization of Conserved Human Immunodeficiency 
Virus Type 1 (HIV-1) gpl20 neutralization Epitopes Exposed Upon gpl20- 
CD4 Binding," J. ViroZ 67:3978-3988 (1993); Sattentau, Q., et al., 
"Conformational Changes Induced in the Envelope Glycoproteins of Human 

15 and Simian Immunodeficiency Virus by Soluble Receptor Binding," J. Virol 
67:7388-7393 (1993); Trkola, A., et al., "CD4-dependent, antibody- sensitive 
Interactions Between HIV-1 and its Co-receptor CCR05," Nature 384:184- 
187 (1996); and WU, L., et al., "CD4-induced Interaction of Primary HIV-1 
gpl20 Glycoproteins with the Chemokine Receptor CCR5," Nature 384:179- 

20 1 83 (1996). 

Chemokine receptors are G protein- coupled, seven-membrane- 
spanning proteins involved in leukocyte chemotaxis. (See, Baggioline, M., et 
al., "Interleukin-8 and Related Chemotactic Cytokines-CXC and CC 
Chemokines," Adv. Immunol 55:97-179 (1994); Gerard, N., et al., "the Pro- 

25 Inflammatory Seven -Transmembrane- Segment Receptors of the Leukocyte," 
Curr. Opin. Immunol 6:140-145 (1994); and Premack, BA., et al, "Chemokine 
Receptors: Gateways to Inflammation and Infection," Nature Medicine 
11:1 174- 1 178 ( 1996)). Most laboratory-adapted HIV- 1 viruses utilize a CXC 
chemokine receptor called CXCR4 (also called LESTR, HUMSTSR or fusin), 

30 while most macrophage-tropic primary HIV-1 viruses use the CC chemokine 
receptor CCR5 (see, Feng, Y., et al., Science 272:872-877 (1996); Choe, H., 
et al., CeZZ 85:1135-1 148 (1996); Deng, HK., et al.. Nature 381:661-666 
(1996); Dragic, T., et al.. Nature 381:667-673 (1996); Doranz, BJ., et al.. Cell 
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85:1149-1158 (1996); and Alkhatib, G., et al., Science 272:1955-1958 
(1996)), and to an extent CCR3 or CCR2. Primary dual-tropic HIV-1 isolates 
use CCR5 as well as CXCR4. (See, Zhang, L., et al., Nature 383:768 (1996) 
and Connor, R., et al., J, Exp. Med, 185:21-628 (1997)). The macrophage- 
5 tropic primary viruses are those most often transmitted from infected to 
uninfected individuals, and predominate during the long, asymptomatic 
phase of infection. (See, Cheng-Mayer, C, et al.. Science 240:80-82; Zhu, T., 
et al., Sdence 261: 1179-1 181 (1993); Fenyo, E., J. Virol 62:4414-4419 
(1988); Schuitemaker, H., et al., J, Virol 66:1354-1360 (1991); and Connor, 

10 RI., et al., J. Virol 67:1772-1778 (1993)). The importance of CCR5 for HIV-1 
transmission is underscored by the observation that humans with 
homozygous defects in CCR5 are relatively resistant to HIV-1 infection. 
(See, Liu, R., et al.. Cell 86:367-378 (1996); Samson, M., et al.. Nature 
382:722-725 (1996); and Dean M., et al.. Science 273:1856-1862 (1996)). 

15 CCR5 is used as a corrector by almost all primary HIV-1 isolates regardless 
of geographic clade, and is used by the related human and primate 
immunodeficiency viruses, HIV-2 and simian immunodeficiency virus, SIV. 
(See, Marcon, L., et al., J. Virol 7 1:2522-2527 (1997); Chen, Z., et al., J. 
Virol 71:2705-2714 (1997); and Cocchi, F., et al.. Science 270:1811-1815 

20 (1995)). This suggests that at least part of the viral binding site for CCR5 is 
well-conserved among these immunodeficiency viruses. While these gpl20 
structures are under investigation and have yet to be completely defined, 
mutagenic studies have suggested that elements of the V3 loop may 
constitute part of the chemokine receptor binding site. Genetic studies of 

25 viruses with chimeric HIV- 1 envelope glycoproteins containing different V3 
loops demonstrated that the gpl20 V3 region is a major determinant of 
which chemokine receptor, CCR5 or CXCR4, can be used as an entry 
cofactor. (See, Cocchi, F., et al.. Nature med., 2:1244-1247 (1996); and 
Speck, R., et al., J. Virol (in press)). Thus, even in the relatively variable 

30 background of the V3 domain, there may exist conserved structural features 
that collaborate with other conserved gpl20 structures to create a high- 
affinity binding site for CCR5. 
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It is likely that the interaction of the gpl20-CD4 complex with the 
appropriate chemokine receptor promotes additional conformational 
changes in the envelope glycoprotein complex. By analogy with the 
influenza hemoglutinin, it has been suggested that the HIV-1 gp41 
5 ectpdomain undergoes major conformational changes during virus entry. 
(See, Carr, CM., et al., CeZZ 73:823-832 (1993); Chen, CH., et al., J. Virol 
69:3771-3777 (1995); BuUough, P., et al. iVature 371:37-43 (1994); and 
Weissenhom, W., et al., EMBO J, 15:1507-1514 (1996)). The proposed 
result of these changes is the insertion of the hydrophobic gp41 amino 

10 terminus (the "fusion peptide") into the membrane of the target cell. 

Mutagenic analysis and the recently determined crystal structures of HIV- 1 
gp41 ectodomain fragments are consistent with this model (see, Freed, E., et 
al., Proc. Natl Acad, Sci USA 87:4650-4654 (1990)). 

The exposed nature of the HIV-1 envelope glycoproteins on the 

1 5 surface of virions or infected cells renders them prime targets for the 

antiviral immune response. In fact, the only viral proteins accessible to 
neutralizing antibodies are the envelope glycoproteins. Neutralizing 
antibodies appear to be an important component of a protective immune 
response, in chimpanzees challenged with HIV-1 (see, Herman, PW., et al., 

20 Nature 345:622-625 (1990); Girard, et al., Proc, Natl Acad, Sci. USA 88:542- 
546 (1991); Emini, et al., iVature 355:728-730 (1991); and Bruck, et al., 
Vaccine 12: 1 14 1- 1 148 (1994). That neutralizing antibodies generated 
during the course of HIV- 1 infection do not provide permanent antiviral 
effect may in part be due to the generation of neutralization escape virus 

25 variants (see, Nara, et al., J. Virol 64:3779-3791 (1990); Gegerfelt, et al., 
Virology 185:162-168 (1991); and Arendrup, et al., J AIDS 5:303-307 
(1992)), and to the general decline in the host immune system associated 
with pathogenesis. 

HlV-1 neutralizing antibodies are mostly directed against linear or 

30 discontinuous epitopes of the gpl20 exterior envelope glycoprotein. Rare 
examples of gp41 -directed neutralizing antibodies have also been 
documented (see. Muster, et al., J. Virol 67:6642-6647 (1993)). Neutralizing 
antibodies that arise early in infected humans and that are readily 
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generated in animals by immunization are primarily directed against linear 
neutralizing determinants in the third variable (V3) loop of gpl20 
glycoprotein (see, Matthews, et al., Proc. Natl Acad. Set USA 83:9709-9713 
(1986); and Javaherian, et al., Science 250:1590-1593 (1990)). These 
5 antibodies generally exhibit the ability to neutralize only a limited number of 
HIV-1 strains, although some subsets of anti-V3 antibodies recognize less 
variable elements of the region and therefore exhibit broader neutralizing 
activity (see, Ohno, et al., Proc, Natl. Acad. Sci. USA 88:10726-10729 (1991); 
Moore, et al., J. Virol. 69:122-133 (1995); and Gomy, et al., J. Virol 66:7538- 

10 7542 (1992)). Envelope glycoprotein variation within the linear V3 epitope 
and outside of the epitope can allow escape of viruses from neutralization by 
these antibodies (see, McKeating, et al., J. Virol 67:4932-4944 (1993)). The 
second variable (V2) region of the HIV-1 envelope glycoprotein has also been 
shown to be a target for strain-restricted neutralizing antibodies (see, Fung, 

15 et al,, J. Virol 66:848-856 (1992); Moore, et al., J. Virol 67:6136-6151 

(1993)), Most of the V2 epitopes consist of continuous but conformation- 
dependent determinants. 

Later in the course of HIV-1 infection of humans, antibodies capable 
of neutralizing a wider range of HIV-1 isolates appear (see, Profy, et al., J. 

20 Immunol 144:4641-4647 (1990); Berkower, et al., J. Exp. Med. 170: 1681- 
1695 (1989); Ho, et al., J. Virol 489-493 (1991); Kang, et al., Proc. natl 
Acad. Set USA 88:6171-6175 (1991); Steimer, et al., Science 254:105-108 
((1991); and Moore et al., J. Virol 67:863-875 (1993)). These broadly- 
neutralizing antibodies have been difficult to elicit in animals (see, Rusche 

25 et al,, Proc. Natl Acad. ScL USA 84:6924-6928 (1987); Klaniecki et al., AIDS 
Res. Hum. Retro. 7:791-798 (1991); and Haigwood, et al., J. Virol 66:172- 
182 (1992)), and are not merely the result of additive anti-V3 loop 
reactivities against diverse HIV-1 isolates that accumulate during active 
infection, A subset of the broadly reactive antibodies, found in most HlV-1- 

30 infected individuals, interferes with the binding of gpl20 and CD4. At least 
some of these antibodies recognize discontinuous gpl20 epitopes (the so- 
called CD4BS epitopes) present only on the native glycoprotein. Human 
monoclonal antibodies derived from HIV- 1 -infected individuals have been 
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identified that recognize the gpl20 glycoproteins from a diverse range of 
HIV-1 isolates, that block gpl20-CD4 binding, and that neutralize virus 
infection (see, Posner, et al., J. Immunol 146:4325-4332 (1991); and Tilley, 
et al., Res. Virol 142:247-259 (1991)). Some of these CD4BS-directed 
5 antibodies efficiently neutralize primary HIV-1 isolates (see, Burton, et al.. 
Science 266:1024-1027 (1994)), which are generally more resistant to 
neutralization than are viruses passaged in immortalized cell lines (see, 
Daar, et al., Proc. Natl Acad, Sci. USA 87:6574-6578 (1990); Wrin, et al., J. 
z/iVoZ.69:39-48 (1995); Sullivan, et al., J. Virol 69:4413-4422 (1995); Sawyer, 

10 et al., J. Virol 67:1342-1349 (1994); Moore, et al., J. Virol 69:101-109 
(1995); and D'Souza, et al., J. Infect Dis. 175:(in press) (1997)). The 
discontinuous epitopes recognized by many of the human monoclonal 
antibodies directed against the CD4BS epitopes have been characterized by 
mutagenic analysis (see, Thali, et al., J, Virol 65:6188-6193 (1991); Thali, et 

15 al., J. Virol 66:5635-5641 (1992); McKeating, et al., Virology 190:134-142 
(1992)), Amino acid changes in seven discontinuous gpl20 regions, four of 
which overlap regions defined to be important for CD4 binding, disrupt 
recognition by these antibodies and, in some cases, allow the generation of 
neutralization escape mutants. 

20 A second group of neutralizing antibodies found in a smaller number 

of HIV- 1 -infected humans is directed against conserved gpl20 epitopes that 
are exposed better upon CD4 binding (see, Thali, et al., J. Virol 67:3978- 
3988 (1993)). These epitopes, referred to as the CD4-induced (CD4i) 
epitopes, are extremely sensitive to conformational changes in the gpl20 

25 glycoprotein. The integrity of these epitopes is affected by gpl20 amino acid 
changes in the conserved stem of the VI /V2 stem-loop structure and in the 
C4 region. The CD4i epitopes have been shown to be proximal to the V3 
loop and to be masked by the VI /V2 variable loops (see, Wyatt, et al., J, 
Virol 69:5723-5733 (1995); and Moore, et al., J, ViroZ 70: 1863-1872 (1996)). 

30 It has been shown that CD4 binding induces a movement of the VI /V2 

loops that exposes the CD4i epitopes. Interestingly, it has been shown that 
neutralizing antibodies directed agadnst either the V3 loop or the CD4i 
epitopes block the ability of gpl20-CD4 complexes to bind CCR5. Thus, it 
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appears that the major groups of neutralizing antibodies generated in HIV- 
1 -infected humans block the binding of virus to its cellular receptors, either 
CD4 or the chemokLne receptors. 

The development of an HIV- 1 vaccine as explained above has been 
5 hampered by the inefficiency with which antibodies directed against the 
more conserved gpl20 stmctures are elicited. Most of the antibodies 
elicited by the HIV- 1 envelope glycoproteins, either in infected humans or 
chimps or in animals immunized with envelope glycoprotein preparations, 
are not able to neutralize virus. Many of these non-neutralizing antibodies 

10 are directed against gpl20 structures that are inaccessible on the native 

envelope glycoprotein complex due to interaction with the gp4 1 ectodomain 
(see, Wyatt, et al., (1997)). When neutralizing antibodies are elicited, these 
are often directed against veiriable portions of the HIV- 1 envelope 
glycoproteins. Most of the neutralizing antibodies elicited by native HIV- 1 

15 gpl20 or gpl60 glycoproteins are directed against the V3 loop (see, 
Haigwood, et al., AIDS Res. Hum, Retro. 6:855-869 (1990)). Multiple 
immunizations with native gpl20 or gpl60 glycoproteins are required to 
elicit even low titers of neutralizing antibodies with broader strain reactivity. 
This same pattern of elicitation of neutralizing antibodies has been observed 

20 in HIV- 1 -infected humans or chimps, with antibodies directed against the 
V3 loop appearing earlier in infection. These results suggest that the 
structure of the HIV-1 gpl20 envelope glycoprotein has evolved to decrease 
the immunogenicity of particular epitopes in which variation is poorly 
tolerated by the virus. By the time immune responses to these epitopes are 

25 elicited, immune compromise has occurred, viral burden is high, and virus 
variation and the potential for neutralization escape has reached significant 
levels. These considerations suggest that use of the native, complete HIV-1 
glycoprotein as an immunogen will most efficiently elicit the same types of 
immune responses that the virus has evolved to evade most efficiently. 

30 Improved immunogens based upon the envelope protein are necessary. 

Previous studies have indicated that the relatively poor surface 
accessibility of the more conserved gpl20 epitopes related to the CD4 and 
chemokine receptor binding sites may in part provide an explanation for the 
low apparent immunogenicity of these regions. 
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One approach to improve the immunogenicity of gpl20 polypeptides 
has been to remove at least a portion of the "masking" variable loops while 
retaining the overall conformation of the polypeptide so that it approximates 
that of the native gpl20. This can be done by appropriate selection of 
amino acid residues to permit the structure to turn. In this manner the 
conserved conformational epitopes are more exposed and can be used to 
generate antibodies to these conserved epitopes. Additional improvements 
in generating such polypeptides would be useful. The VI /V2 and V3 
variable loops of the HIV-1 gpl20 glycoprotein have been shown to mask the 
CD4BS epitopes, and removal of these variable regions results in a 5-50-fold 
increase in exposure of most of the CD4BS epitopes, on both the monomeric 
cind the multimeric envelope glycoproteins. Removal of the VI and V2 
variable loops results in an increased exposure of HIV-1 gpl20 epitopes {V3 
and CD4i epitopes) located near the binding site for the chemokine 
receptors. Thus, both of the receptor-binding regions of the HIV-1 gpl20 
glycoprotein are partially masked by the large variable loop structures of the 
glycoprotein. 

It is imperative that means of efficiently eliciting an array of 
antibodies directed against the more conserved gpl20 elements be 
developed. 

SUMMARY OF THE INVENTION 

We have now found polypeptides that are modified from a primate 
lentivirus envelope glycoprotein such as the HIV- 1 envelope glycoproteins 
that can improve the stability and/ or enhance inamunogenicity of 
neutralization epitopes, particularly those conserved on different primary 
viruses such as the CD4BS and/or CD4i epitopes. The modifications 
include the deletion of particular variable loops and /or stabilization of 
functionally relevant envelope glycoprotein structures through the formation 
of internal disulfide bonds. For example, we have found that introducing 
cysteine residues at at least one of the following pairs of amino acid residues 
results in the formation of disulfide bonds and substantially stabilizes the 
structure of the protein: 





-WO 99/24465 



PCT/US98/24001 



- 12 - 



Pro 1 18* 



Ala 433 



Leu 122 



Gly 431 
Gly 380 



Phe 210 



Ser 256 



Phe 376 



* The numbering is based upon HXBc2 numbering and 



10 



15 



20 



25 



can readily be extrapolated to other viruses £ind straiins. 
Preferably disulfide bonds are introduced at either Prol 18^Ala433, Leu 122- 
Gly431, Phe210-Gly380, or Ser256-Phe376. 

Alternatively, or in addition, one can fill the cavities discovered in the 
interior of HIV- 1 gpl20 with hydrophobic residues such as Ser375->Trp, 
Vall55->Trp, Arg273-^Trp, Ser481->Phe, Ser447^Ile. These cavity-filling 
substitutions should stabilize a native HIV-1 gpl20 conformation. 

Alternatively, or in addition, one can introduce prolines at defined 
turn structures such as Ile423— >-Pro, thus stabilizing these turn structures 
in the gpl20 "bridging sheet,'* which appear to be conformationally flexible 
(see below). 

Alternatively, or in addition, one can increase the hydrophobicity 
across the interface between the gpl20 domains such as Asn377->Leu, 
Thr283->Ile, and Asp477->Leu. These substitutions are predicted to 
decrease interdomain flexibility. 

These changes can be inserted in a polypeptide that contains all the 
variable regions, or more preferably, into a polypeptide wherein at least a 
portion of a variable region, preferably the VI /V2 loops, has been deleted 
with a linker amino acid residue inserted to retain turns in the structure so 
that it approximates the conformation of at least one discontinuous 
conformation epitope of the native envelope protein such as CD4BS or CD4i 
epitopes. 

BRIEF DESCRIPTION OF THE FIGURES 

Figs. lA-lE show the structure of the HIV-1 gpl20 region implicated 
in CCR5 binding. 

Fig. lA shows a ribbon drawing of the HIV-1 gpl20 glycoprotein 
complexed with CD4. The perspective is that from the target cell membrane. 
The two amino-terminal domains of CD4 are shown in blue. The gpl20 
inner domain is colored red, the outer domain is colored yellow, and the 
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"bridging sheet" is orange. The gpl20 residues in which changes resulted in 
a >90% decrease in CCR5 binding are labeled. The VI /V2 stem and tiie 
base of the V3 loop (strands pi2 and pi 3 and the associated turn) are 
indicated. 

Fig. IB shows a molecular surface of the gpl20 glycoprotein from the 
same perspective as that of Fig. lA. Colored surfaces are associated with 
gpl20 residues in which changes resulted in either a >75% decrease 
(yellow), a >90% decrease (red) or a >50% increase (green) in CCR5 binding, 
when CD4 binding was at least 50% of that seen for the wtA protein. 

Fig. IC shows the surface depicted in Fig. IB colored according to the 
degree of conservation observed among primate immunodeficiency viruses 
(25). Red indicates conservation among all human and simian 
immunodeficiency viruses; orange indicates conservation among all HIV-1 
isolates, including group O and chimpanzee isolates; yellow indicates 
modest variability and green indicates substsmtial variability among HIV- 1 
20 isolates. 

Fig. ID shows the molecular surface of the gpl20 glycoprotein, 
indicating residues in which changes resulted in a >70% decrease in 17b 
antibody binding, in the absence of sCD4. 

Fig. IE shows the molecular surface of the gpl20 glycoprotein, 
indicating residues in which changes resulted in a >70% decrease in CGIO 
antibody binding in the presence of sCD4. Residues in which changes 
significantly decreased CD4 binding (and thus indirectly decreased CGIO 
binding) are not shown. Images were made with Midas-Plus (Computer 
Graphics Lab, University of California, San Francisco) and GRASP. 

Figure 2 shows the molecular surface of the gpl20 outer domain 
colored according to the variability observed in gpl20 residues among 
primate immunodeficiency viruses. Red indicates residues conserved 
among all primate immunodeficiency viruses; orange, residues conserved in 
all HIV-1 isolates; yellow, residues exhibiting some variation among HIV-1 
isolates; and green, residues exhibiting significant variability among HIV-1 
isolates. The inner gpl20 domain is colored red and the outer domain is 
colored yellow. The Bridging sheet" is colored orange. The N- and C-termini 
of the truncated gpl20 core are labeled, as are the positions of structures 
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related to the gpl20 variable regions, V1-V5. The HA, TIC, ED and HE 
surface loops'2 are shown. The position of the "Phe 43" cavity involved in 
CD4 binding is indicated by an asterisk. A gpl20 surface implicated in 
binding to the CCR5 chemokine receptor is indicated. The variability of the 
5 gpl20 surface shown is underestimated since the V4 variable loop, which is 
not resolved in the structure, contributes to this surface (approximate 
location is indicated). The position of the V5 region is shown. Also note the 
highly conserved glycosylation site (asparagine 356 and threonine/ serine 
358) within the HE loop, between the V5 and V4 regions. In the figure on 

10 the right, the V4 loop and the carbohydrates are modeled, as described in 
Materials and Methods. The complex carbohydrate addition sites used in 
mammalian cells'4 are colored light blue, and the high-mannose sites are 
colored dark blue. The gpl20 protein surface is shown in white. 

Figures 3A-3D show the spatial relationship of epitopes on the HIV-1 

15 gpl20 glycoprotein. 

Figure 3A shows the molecular surface of the gpl20 core The 
modeled N-terminal gpl20 core residues, V4 loop and carbohydrate 
structures are included. The variability of the molecular surface is 
indicated, using the color scheme described in Figure 2. The modeled 

20 carbohydrates are colored light blue (complex sugars) or dark blue (high- 
mannose sugars). The approximate locations of the V2 and V3 variable 
loops are indicated. Note the well-conserved surfaces near the "Phe 43" 
cavity and the chemokine receptor-binding site. 

Figure 3B shows a Ca tracing of the gpl20 core. The gpl20 residues 

25 within 4A of the 17b CD4i antibody are shown in green. The residues 
implicated in the binding of CD4BS antibodies20 are shown in red. 
Changes in these residues significantly affect the binding of at least 25 
percent of the CD4BS antibodies listed in Table 1. The residues implicated 
in 2G12 bindings are shown in blue. The V4 variable loop, which 

30 contributes to the 2G12 epitope, '9 is indicated by dotted lines. 

Figure 3C shows the molecular surface of the gpl20 core, oriented 
and colored as in Figure B. 

Figure 3D shows the approximate locations of the faces of the gpl20 
core, defined by the interaction of gpl20 and antibodies. The molecular 
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surface accessible to neutralizing ligands (CD4 and CD4BS, CD4i and 2G12 
antibodies) is shown in white. The neutralizing face of the complete gpl20 
glycoprotein includes the V2 and V3 loops, which reside adjacent to the 
surface shown. The approximate location of the gpl20 face that is poorly 
5 accessible on the assembled envelope glycoprotein trimer and therefore 
elicits only non-neutralizing antibodiesS6 is shown in purple. The 
approximate location of an immunologically "silent" face of gpl20, which 
roughly corresponds to the highly glycosylated outer domain surface, is 
shown in blue. 

10 Figure 4 is a schematic showing the probably arrangement of the 

HIV-1 gpl20 glycoproteins in a trimeric complex. The gpl20 core was 
organized into a trimeric array, based on the criteria discussed in the text. 
The perspective is from the target cell membrane, similar to that shown in 
Figure 2. The CD4 binding pockets are indicated by black arrows, and the 

15 conserved chemokine receptor-binding regions are colored red. The areas 
shaded light green indicate the more variable, glycosylated surfaces of the 
gpl20 cores. The approximate locations of the 2G12 epitopes are indicated 
by blue arrows. The approximate locations for the V3 loops {yellow) and V4 
regions (green) are shown. The positions of the V5 regions (green) and some 

20 complex carbohydrate addition sites (asparagines 276, 463, 356, 397 and 
406) (blue dots) are shown. The approximate locations of the large VI /V2 
loops, centered on the known positions of the VI /V2 stems, are indicated 
(green). On one of the gpl20 subunits, the positions of the ID and HE loops 
are indicated. The distance 

25 DETAILED DESCRIPTION OF THE INVENTION 

We have discovered a series of novel polypeptides that can (1) 
enhance the immunogenicity of primate lentivirus envelope proteins for 
certain conserved epitopes, (2) generate a greater range of antibodies against 
"masked" gpl20 structures and/or (3) stabilize the three-dimensional 

30 structure of the molecule. 

We have discovered regions where disulfide bonds can be inserted 
which will stabilize the conformation of the molecule in a conformation 
approximating the native envelope glycoprotein conformation. We have 
discovered conserved regions and epitopes that are critical for CD4 and 
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chemokine receptor binding. We have discovered critical turn structures of 
the molecules as well as internal cavities that decrease the immunogenicity 
of epitopes that would raise antibodies that could block CD4 binding and/or 
chemokine binding. 

Preferably, the envelope protein is selected from the group consisting 
of HIV or SIV. More preferably, it is HIV. Still more preferably, it is HIV- 1 
gpl20. 

We have succeeded in growing crystals of gpl20 (from the HXBc2 
HIV-1 strain) in a ternary complex with two-domain CD4 (Dl D2 sCD4) and 
the Fab fragment of a CD4i neutralizing antibody, 17b Fab. The crystals 
diffracted to a minimum Bragg spacing of at least 2.2A, and data have been 
collected from cryogenically preserved crystals on the native complex as well 
as on isomorphous heavy atom derivatives. While some elements of the 
HIV-1 gpl20 structure (e.g. the V3 loop) are not supplied by analysis of 
these crystals, the vast majority of the gpl20 residues are able to be defined 
in the structure. Importantly, all of the gpl20 residues thought to 
contribute to the CD4BS and CD4i neutralization epitopes are defined in the 
available structure. 

Many of the antibody responses elicited against the HIV-1 envelope 
glycoproteins during natural infection of humans are incapable of 
neutralizing the virus. Studies of monoclonal antibodies derived from HIV- 
1 -infected individuals indicate that most of these non-neutralizing 
antibodies are directed against elements of the gpl20 and gp41 
glycoproteins that interact on the assembled oligomer. These elements are 
not accessible on the functional envelope glycoprotein spike on the virus 
membrane or infected cell surface, thereby rendering the antibodies directed 
against them ineffectual at neutralization. The labile association of gpl20 
and gp41, which exposes and/ or creates the epitopes for these non- 
neutralizing antibodies, apparently represents an adaptive mechanism for 
lentiviruses such as HIV- 1 to divert the humoral immune response under 
conditions where antigen is limiting. 

A corollary is that the gpl20 glycoprotein dissociated from the 
functional oligomer may have evolved to be less effective at eliciting 
neutralizing antibodies directed against conserved gpl20 structures. This 



wo 99/24465 



PCT/US98/24001 



- 17 - 

corollary appears to be supported by the many attempts to elicit neutralizing 
antibodies by gpl20 immunogens over the past several years. Dissociation 
from gp4 1 apparently results in an increase in the conformational flexibility 
of the gp41 -interactive regions of gpl20, predisposing the gpl20 
glycoprotein to elicit non-neutralizing antibodies preferentially over the more 
broadly neutralizing antibodies. This conformational flexibility can have two 
consequences relevant to selective elicitation of non-neutralizing antibodies: 

1) The flexibility and surface exposure of the gp41 -interactive CI 
and C5 regions on free gpl20 can make these structures more 
immunogenic; and 

2) Conformational flexibility in the CI and C5 regions, can mask 
many CD4BS epitopes, may disrupt these epitopes and decrease the 
efficiency with which CD4BS-directed antibodies are elicited. 

Thus, we have found a number of positions where disulfide bonds 
can be introduced to stabilize the polypeptide's structure. This is important 
given the structure of the molecule. 

For example, the gpl20 core is composed of an inner domain, an 
outer domain, and a "bridging sheet" (Fig. lA). The "bridging sheet" is a 
four-stranded, antiparallel p-sheet that includes the VI /V2 stem and 
strands ((320 and |321) derived from the fourth conserved gpl20 region. CD4 
contacts gpl20 residues in the outer domain and the "bridging sheet". The 
gpl20 residues implicated by our study in CCR5 binding are located near or 
within the "bridging sheet" (Figs. lA and IB). The "bridging sheet" is 
predicted to face the target cell after the envelope glycoproteins bind CD4. 
Even more than the CD4-binding site, the gpl20 region implicated in CCR5 
binding is highly conserved among primate immunodeficiency viruses; this 
is particularly apparent in comparison to the remainder of the gpl20 
surface thought to be exposed on the assembled envelope glycoprotein 
complex (See Figs. 10 and 2). The CD4i epitope for the 17b antibody is 
located near or within the "bridging sheet", consistent with the ability of the 
antibody to block CCR5 binding. All of the individual gpl20 residues in 
which changes disrupted recognition by the 17b antibody (Fig. ID) are 
located close to the gp 120- 17b interface in the crystallized complex (Table 
1). The binding of another antibody, CGIO, which disrupts gpl20-CCR5 
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interaction and competes with the 17b antibody for gpl20 binding, is also 
affected by changes in amino acid residues within or near the "bridging 
sheet" (Fig. IE). The position and orientation of the V3 base in the 
structure, in conjunction with a number of mutagenic and antibody 
5 competition studies, indicates that the gpl20 V3 loop resides proximal to 

the region implicated in CCR5 binding (Fig. lA). For example, the binding of 
both CGIO and CD4i antibodies to gpl20 can be disrupted by some V3 
changes. Furthermore, several V3-directed antibodies compete with CD4i 
antibodies for gpl20 binding. 

10 We have discovered that the CCR5-binding site is likely composed of 

conserved gpl20 elements near or within the "bridging sheet" and V3 loop 
residues. The latter apparently includes more conserved structures (e.g. the 
aromatic or hydrophobic residue at position 317), as well as more variable 
structures that determine the specific chemokine receptor used. Some of 

15 the gpl20 residues identified in this and previous studies as determinants 
of chemokine receptor utilization can modulate the interaction of the V3 
loop and elements near the "bridging sheet". Studies of HIV- 1 revertants 
suggested a functional interaction of gpl20 residue 440, shown here to 
influence CCR5 binding, with the V3 loop. 

20 A subset of the gpl20 residues in or near the "bridging sheet" 

apparently contacts CCR5 directly. Most of the gpl20 residues implicated 
in CCR5 binding exhibit reasonable solvent accessibility in the free gpl20 
core (Table 1). The gpl20 surface implicated in CCR5 binding is highly 
basic, favoring interactions with the acidic CCR5 amino terminus, which 

25 has been shown to be important for gpl20 binding. Additional, hydrophobic 
interactions, similar to those seen for gp 120- 17b binding, can also 
contribute to the gpl20-CCR5 interaction. 

The exposure and/ or formation of the CCR5-binding site of HIV- 1 
gpl20 glycoproteins is dependent upon interaction with CD4. CD4 binding 

30 has been shown to reposition the VI /V2 variable loops and thus expose the 
CD4i epitopes, which overlap the CCR5-binding region. However, since a 
gpl20 glycoprotein lacking the VI and V2 variable loops also exhibits CD4- 
dependent CCR5 binding, the interaction with CD4 must cause other 
conformational changes in gpl20 related to the CCR5-binding site. Our 
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results, which highlight the proximity of the two receptor-binding sites on 
gpl20, help explain the induction of such conforraational changes. First, 
one of the components of the "bridging sheet", the V1/V2 stem, also 
contacts CD4. Thus, CD4 binding, which appears to distort the V1/V2 
5 stem, may reposition this structure and allow the formation of the p-sheet 
important for CCR5 binding. In this respect, a substitution of aspartic acid 
for threonine 123, which is located in the V1/V2 stem and contacts CD4, 
significantly decreases CCR5 binding. This substitution can disrupt CD4- 
induced conformational changes in the VI /V2 stem required for CCR5 

10 binding. Second, the CD4-bound conformation of gpl20 exhibits a cavity 
(the "Phe 43" cavity) within the gpl20 interior. This cavity contacts the 
gpl20 inner and outer domains as well as the "bridging sheet" and likely 
forms as a result of interdomain conformational changes in gpl20 induced 
by CD4 binding. Since the "bridging sheet" lacks its own hydrophobic core 

15 and is thus dependent upon residues contributed by both inner and outer 
domains, any shift in orientation between these domains would alter the 
conformation of the "bridging sheet". Furthermore, CD4 binding could also 
alter the precise orientation of the "bridging sheet" with respect to the inner 
and outer domains, thus aligning the V3 loop and conserved gpl20 

20 elements important for CCR5 binding. 

CD4 binding induces conformational changes within the "bridging 
sheet" as well as between this sheet and the inner and outer domains to 
form the high-affinity CCR5 binding site. For some primate 
immunodeficiency viruses, the CD4-bound conformation of gpl20 must be 

25 energetically assessable in the absence of CD4, which would explain the 

documented examples of CD4-independent chemokine receptor binding and 
entry. 

The CCR5-binding region defined in this study using HIV- 1 is also 
important for the binding of the other primate lentiviruses such as simian, 
30 and of human immunodeficiency viruses to other chemokine receptors. The 
identified region exhibits one of the most highly conserved surfaces on the 
HIV-1 gpl20 glycoprotein, supporting its functional importance for all 
primate immunodeficiency viruses. The laboratory- adapted HXBc2 envelope 
glycoprotein, which uses CXCR4 and not CCR5 as a corrector, can be 
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converted to an efficient CCR5-using protein simply by substituting the V3 
loop of the YU2 virus. Thus, all of the necessary CCR5-binding region 
outside of the V3 loop are conserved, as demonstrated by the substitution 
between the divergent HXBc2 and YU2 viruses. Indeed, we have shown that 
5 alteration of the lysine 1 17, lysine 207 and glycine 441 in the HXBc2-YU2V3 
chimeric protein also disrupts CCR5 binding. Consistent with the use of 
this region for the binding of other chemokine receptors is the observation 
that the gpl20 changes associated with the conversion of HIV-2 to a CD4- 
independent, CXCR4-using virus affect the "bridging sheet" and the V3 loop. 

10 Alterations in "bridging sheet" residues have also been implicated in 
changes in the tropism of HIV- 1 for immortalized cell lines that do not 
express CCR5. And, the 17b antibody neutralizes HIV-1 strains that use 
different chemokine receptors, thereby supporting our finding of the 
involvement of a common gpl20 region in chemokine receptor interaction. 

1 5 Chemokine receptor binding can trigger additional conformational 

changes in the envelope glycoprotein complex that ultimately lead to the 
fusion of the viral and target cell membrane. Some of these changes include 
exposure of the ectodomain of the gp4 1 transmembrane envelope 
glycoprotein. The CCR5-binding region defined herein resides close to the 

20 trimer axis of the assembled envelope glycoprotein complex. Indeed, some 
of the gpl20 residue changes that affect CCR5 binding also affect the non- 
covalent association of gpl20 and gp41 subunits in the trimeric complex. 
This indicates that chemokine receptor binding alters the relationship 
between gpl20 and gp41, leading to the exposure of the gp41 ectodomain 

25 and interaction with the target cell membrane. 

Stabilizing the structure of an envelope protein such as the gpl20 
glycoprotein should improve the ability of the glycoprotein to eUcit desirable 
neutralizing antibody responses. This follows from our observation that all 
of the conserved HIV-1 gpl20 neutralization epitopes span gpl20 domains 

30 that exhibit potential flexibility. Stabilization of the gpl20 structure can be 
achieved by introducing new disulfide bridges at specific locations on the 
gpl20 chain. This targeted introduction of disulfides is designed to 
maintain the molecule in a conformation wherein at least the CD4BS or 
CD4i epitopes approximate the wild- type conformation. We expect this 
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disulfide bonding to preserve the integrity of the relevant neutralization 
epitopes. 

The disulfide bonds can be introduced at a number of different amino 
acid residues in the gpl20 structure. The only precaution is not to replace 
an amino acid residue critical for the generation of an antibody to a 
conserved epitope. A number of these epitopes are set forth in the tables 
generated by the binding assay. Residues that can be used include Pro 1 18- 
Ala433 (using HXBc2 numbering), Leul22-Gly431, Phe210-Gly380, and 
Ser256-Phe376. The respective amino acid residues in other strains can 
readily be derived by standard means such as aligning the amino acid 
sequence by any standard computer homology program (e.g. these include, 
but are not limited to BLAST 2.0 such as BLAST 2.0.4 and 2.0.5 available 
from the NIH (See www.ncbi.nlm.nkh.aov/BLAST/newblast.html ) (Altschul, S.F., et al. 
Nucleic Acids Res. 25: 3389-3402 (1997))and DNASIS (Hitachi Software 
Engineering America, Ltd.) under the default setting. Preferably one inserts 
disulfide bonding at one of Pro-Ala or Leu-Gly and one of Phe-Gly or Ser- 
Phe. 

In addition, other residues that can be used can be determined based 
upon the following criteria: 

1) The two residues targeted for cysteine substitution are distant 
on the gpl20 linear sequence, thus increasing the entropic benefit of the 
cysteine bridge (see below); 

2) The C atoms of the selected residues are vrithin 6X of one 
another and the Cp atoms within 4X of each other, in the native gpl20 
structure; 

3) Neither of the selected residues is proximal in the structural 
model to naturally occurring gpl20 cysteines, nor do natural disulfide 
bonds already link the targeted gpl20 strands; 

4) The substituted residues as aforesaid, do not make major 
contributions to the binding of desired neutralizing antibodies; 

5) If internal residues are chosen, both residues are involved in 
mutual packing interactions. 

Adherence to these criteria should optimize the opportunity to 
generate well-folded gpl20 glycoprotein derivatives in which the natural 
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disulfide bonds form and the introduced cysteines create an additional novel 
disulfide bond. Within a 6X inter-C distance, the possibility for either cis- 
or trans-disulfide bond formation allows considerable flexibility in 
interatomic distances. 
5 These choices can be further confirmed by taking the overall energy 

considerations into account. For example, theoretical and empirical studies 
of the effects of added covalent cross-links on the folded state have been 
conducted on model proteins (see, Hazes, et al., Protein Eng, 2:1 19-125 
(1988); Muskai et al.. Protein Eng. 3:667-672 (1990); Reiter, et al., Protein 

10 Eng, 8:1323-1331 (1995); Sowdhamini, et al.. Protein Eng. 3:95-103 (1989); 
Zhou, et al., Biochem 32:3178-3187 (1993); Johnson, et al., Biochem. 
17:1479-1483 (1978); and Pace, et al., J. Biol Chem. 263:11820-11825 
(1988)). Most proteins can be modeled as existing in two states, native and 
unfolded, the ratio of which at any given temperature, pH and salt 

15 concentration can be specified by an equilibrium constant Kf (see. Kyte, et 
al., Structurei n Protein Chemistry pp. 445-466 (1995)). The equilibrium 
constant of folding (Kf) is related to the standard free energy of folding OC^f) 
by the equation )G°f = RT in Kf, where R is the gas constant and T is the 
temperature. The )G°f value is primarily the sum of the favorable enthalpic 

20 contribution of removal of hydrophobic amino acids from contact with the 

aqueous environment and the unfavorable loss of configurational entropy of 
the unfolded, random coil. Under physiologic conditions, the )G°f value for 
most proteins is slightly negative (-30 to -60 kj/mole), thus favoring the 
native conformation. 

25 The introduction of disulfide or other covalent bonds cross-linking 

strands of a protein has been demonstrated to stabilize the native state of 
the protein, lowering the )G°f value. Since proteins must be already folded 
to allow cysteines that are adjacent in the native structure to form a 
disulfide bond, and cysteine bridges per se contribute little to enthalpic 

30 changes favorable to folding, the vast majority of the stabilizing effect of 
disulfide bonds on the native state derives from a decrease in the 
configurational entropy of the unfolded protein. A practical consequence of 
this is that the greater the distance in the linear amino acid sequence 
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between the two cysteines that are cross-linked, the greater the magnitude 
of the stabilizing effect on the native conformation. These theoretical 
considerations have been supported by experiments introducing cross-links 
into proteins at various positions and determining the resulting Kf and )G°f 
5 values. The decreases in )G°f associated with cross-linking in these 

experiments were on the order of -20 kJ/mole, which can exert considerable 
effects on stabilization of native structure (considering that the difference 
between unfolded and native status is typically only -30 to -60 kJ/mole). 
Since the existing intrachain disulfide bridges in the HIV-1 gpl20 

10 glycoprotein only minimally constrain the potential conformations available 
to the denatured protein, a significant benefit should accrue by introducing 
additional, properly positioned cross-links. 

The gpl20 exists in three domains, and the presence of cavities 
wedged between these domains offers the possibility of interdomain 

15 flexibility. Since the conserved neutralization epitopes on gpl20 span two 
domains, such flexibility can render the protein incapable of efficiently 
eliciting these kinds of desirable antibodies. The selective use of 
introducing hydrophobic amino acid residues in the modified envelope 
protein can enhance immunogenicity as discussed below. 

20 The disulfide stabilized mutants can be created by the site-directed 

mutagenesis of a plasmid designed to express the soluble HIV-1 gpl20 
glycoprotein in the supematants of Drosophila cells by known means. For 
example, 89.6 and YU2 gpl20 or any other gpl20 glycoproteins can be 
used. Cell supematants can be examined for the production of properly 

25 folded gpl20 glycoproteins, using a pool of sera from HIV- 1 -infected 

humans, which will recognize even misfolded gpl20 molecules, and a panel 
of conformation-dependent anti-gpl20 monoclonal antibodies. Properly 
folded proteins with desirable epitopes intact will be purified by 
immunoamnity chromatography using a CD4BS-directed antibody (F105) 

30 column. 

Several methods are available to document the formation of, for 
example, the desired disulfide bond in the gpl20 glycoprotein. Chemical 
methods allow an estimate of the percentage of the proteins in a given 
preparation that form the disulfide bond. For example, ethylenimine reacts 
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with cysteine under mild conditions to form an S-(paminoethyl)-cysteine 
derivative, v^^hich can be detected in protein hydroly sates by 
chromatographic analysis. The presence of these derivatives indicates that 
at least some of the cysteines in the protein are free, and the percentage of 
5 these unpaired cysteines can be estimated by using other methods that do 
not distinguish cysteine from cystine (e.g., ethylenimine in conjunction with 
a reducing agent, or performic acid oxidation (Rafferty, Biochem. Biophys. 
Res. Commun. 10:467 (1963); and Moore, S., J. Biol Chem.238:235 (1963)). 
Analysis of proteolytic fragments of the wild-type and mutant glycoproteins 

10 is a second approach capable of documenting the formation of the desired 
disulfide bond. The latter method can be used in conjunction with 
monoclonal antibodies directed against specific linear peptides of gpl20 to 
verify that peptides in the vicinity of the putative disulfide bond exhibit 
altered behavior upon proteolysis of wild- type and mutant glycoproteins. 

1 5 The formation of an additional disulfide bond bridging linearly distant 

gpl20 regions that are not already constrained by existing disulfide bonds 
should result in a significant effect on Kf and )G^f. Since under 
physiologiccd conditions, most proteins are stably folded in their native 
state, estimates of Kf and )G°f are typically made under conditions of low 

20 pH, higher temperature and/ or the presence of urea or guanidinium 

chloride. Since the protein folding reaction must occur reversibly to obtain 
estimates of KF or )G''f, the test should avoid the use of high temperatures 
that often lead to irreversible changes in proteins. Instead, the denaturation 
of the wild-t3T>e and cross-linked mutant gpl20 glycoproteins should be 

25 compared over a range of chaotropic salt concentrations and pH values. A 
number of physical properties of proteins have been used to monitor protein 
folding, including intrinsic viscosity, optical rotation, molar ellipticity, 
ultraviolet light absorption, electrophoretic mobility and sedimentation 
velocity. Absorption of ultraviolet light can be studied for the wild-type and 

30 mutant gpl20 glycoproteins produced in Drosophila cells, since this 
parameter is easily measured and reliably detects changes in protein 
folding. The two states of gpl20, native and denatured, exist, Kf and )G**f 
can be determined for each concentration of guanidinium chloride. 
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temperature and pH directly from the absorbance versus salt/pH curves. 
Typically, Kf and )G°f values obtained under these varying conditions are 
used to extrapolate to physiologic salt and pH values, although the 
stabilizing effect of the introduced disulfide should be evident over a wide 
5 range of pH and chao tropic salt concentrations. 

As mentioned above, one can also alternatively introduce Pro at 
defined turn structures. For example, at Ile423. These changes can readily 
be made and tested, specifically to see that the integrity of relevant 
neutralization epitopes is retained. 

10 To enhance the ability to generate antibodies one can increase the 

hydrophobicity of various cavities in the molecule. The presence of cavities 
in the CD4-bound gpl20 structure probably reflects interdomain flexibility 
in the non-CD4-bound portion. The interdomain flexibility could decrease 
the integrity of CD4BS and CD4i epitopes, and other conserved structures. 

1 5 One way of dealing with this problem is to increase the hydrophobic 

residues in the cavity. Hydrophobic residues are well-known and include 
Trp, Phe, Leu, and lie. One can change some of the non-hydrophobic 
residues into hydrophobic residues, or increase the hydrophobicity of 
already hydrophobic residues. An increase in the size of the side chain can 

20 be tolerated, depending on the volume of the cavity to be filled. The changes 
can be made by site-directed mutagenesis or other known means. The 
changes can be tested for their effect on antibody binding by using a panel 
of known antibodies that bind to a desired epitope, e.g., using CD4BS 
epitopes. Examples of the changes that can be made include Ser375^Trp, 

25 Val255->Trp, Arg273->Trp, Ser481^Phe, and Ser447-^Ile. Preferably, at 

least one of the amino acid residues in the cavity are changed, i.e., they are 
Trp or Phe or lie, instead of the wild-type configuration. 

The recessed nature of the CD4 binding pocket may delay the 
generation of high affinity antibodies against the CD4BS epitopes and can 

30 afford opportunities to minimize the antiviral efficacy of such antibodies 

once they are elicited. The degree of recession is beUeved to be even greater 
on the full length glycosylated gpl20 than is evident on the crystallized 
gpl20 core. The recessed pocket is flanked on one side by the V1/V2 stem 
loop structure. The V2 loop apparently folds back along the VI /V2 stem 
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with V2 residues 183-188 proximal to Asp 368 and Glu 370. This can 
enhance masking of the adjacent CD4BS and CD4i gpl20 epitopes and 
divert antibody responses toward the variable loops. This may be dealt with 
by using gpl20 polypeptides where at least a portion of the variable loop has 
5 been deleted as described in U.S. Patent No. 5,817,315. 

Still more preferably, more than one of the amino acid residues have 
these changes. 

One can also increase the hydrophobicity across the interface 
between the gpl20 domains. Hydrophobic residues that fill the interdomain 

10 cavities will decrease interdomain flexibility. 

Thus, one should increase the generation of antibodies by the 
conserved receptor-regions, and can enhance immunogenicity or raise a 
greater number of antibodies to these desired sites than the wild-type 
protein does. This can be done by having the polypeptide contain 

15 hydrophobic residue at certain interface sites instead of other residues. For 
example, having Leu, He, Trp, etc., such as Leu instead of Asn377, He 
instead of Thr283, and/or Leu instead of Asp477. The key to the 
substitution is to preserve the conformational integrity of the desired 
neutralization epitope, while at the same time filling the interdomain 

20 cavities. 

The integrity of relevant neutralization epitopes on an envelope 
glycoprotein such as gpl20 can be verified with a panel of monoclonal 
antibodies, as described above. Purified mutant proteins that exhibit 
formation of e.g., the desired disulfide bond and increased stability of a 

25 native conformation can be used to immunize mice, in parallel with the 
wild-type gpl20 as a control. 

The polypeptides of this invention can be used to generate a range of 
antibodies to gpl20. For example, antibodies that affect the interaction with 
the binding site can be directly screened for example using a direct binding 

30 assay. For example, one can label, e.g. radioactive or fluorescent, a gpl20 
protein or derivative and add soluble CD4. There are various soluble CD4s 
known in the art including a two-domain (D1D2 sCD4) and a four-domain 
version. The labeled gpl20, or derivative, e.g., a conformationally intact 
deletion mutant such as one lacking portions of the variable loops (e.g. 
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VI /V2) ajid in some instances constant regions and soluble CD4 can be 
added to medium containing a cell line expressing a chemokine receptor 
that the antibody will block binding to. In this example, the derivative will 
block binding to CCR5. Altematively, when using a derivative from a T cell 
tropic gpl20 one would use a cell line that expresses CXCR4. Binding can 
then be directly measured. The antibody of interest can be added before or 
after the addition of the labeled gpl20 or derivative and the effect of the 
antibody on binding can be determined by comparing the degree of binding 
in that situation against a base line standard with that gpl20 or derivative, 
not in the presence of the antibody. 

A preferred assay uses the labeled gpl20, or derivative portion, for 
example a gpl20 protein derived from an M-tropic strain such as JR-FL, 
iodinated using for instance solid phase lactoperoxidase (in one example 
having a specific activity of 20 jaCi/^ig). The cell line containing the 
chemokine receptor in this example would be a CCR5 cell line, e.g. LI. 2 or 
membranes thereof. Soluble CD4 would be present. 

In one embodiment, the conformational envelope polypeptide, such as 
gp 120 should contain a sufficient number of amino acid residues to define 
the binding site of the gpl20 to the chemokine receptor (e.g. typically from 
the V3 loop) and a sufficient number of amino acids to maintain the 
conformation of the peptide in a conformation that approximates that of 
wild-t3TDe gpl20 bound to soluble CD4 with respect to the chemokine 
receptor binding site. Preferably, the VI /V2 loops are deleted. In other 
embodiments at least portions of the V3 loop can be removed to remove 
masking amino acid residues. In order to maintain the conformation of the 
pol3TDeptide one can insert linker residues that permit potential turns in the 
pol5^ep tides structure. For example, amino acid residues such as Gly, Pro 
and Ala. Gly is preferred. Preferably, the linker residue is as small as 
necessary to maintain the overall configuration. It should typically be 
smaller than the number of amino acids in the variable region being deleted. 
Preferably, the linker is 8 amino acid residues or less, more preferably 7 
amino acid residues or less. Even more preferably, the linker sequence is 4 
amino acid residues or less. In one preferred embodiment the linker 
sequence is one residue. Preferably, the linker residue is Gly. 
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In one preferred embodiment, the gpl20 also contains a CD4 binding 
site (e.g. from the C3 region residues 368 and 370, and from the C4 region 
residues 427 and 457). The chemokine binding site is a discontinuous 
binding site that includes portions of the C2, C3, C4 and V3 regions. By 
5 deletion of non-essential portions of the gpl20 pol3^eptide — such as 
deletions of portions of non-essentiaJ variable regions (e.g. VI /V2) or 
portions in the constant regions (e.g. CI, C5) one can increase exposure of 
the CD4 binding site. Another embodiment is directed to a gpl20 portion 
containing a chemokine binding site. Similarly, by deleting the non- 

1 0 essential portions of the protein one can increase exposure of the 

chemokine binding site. The increased exposure enhances the ability to 
generate an antibody to the CD4 receptor or chemokine receptor, thereby 
inhibiting viral entry. Removal of these regions is done while requiring the 
derivative to retain an overall conformation approximating that of the wild- 

15 type protein with respect to the native gpl20 binding region, e.g. the 

chemokine binding region when complexed to CD4. In addition, one can 
remove glycosylation sites that are disposable for proper folding (see Wyatt 
et al., U.S. provisional application no. EL014417278US, filed June 17, 
1998). Maintaining conformation can be accomplished by using the above- 

20 described linker residues that permit potential turns in the structure of the 
gpl20 derivative to maintain the overall three-dimensional structure. 
Preferred amino acid residues that can be used as linker include Gly and 
Pro. Other amino acids can also be used as part of the linker, e.g. Ala. 
Examples on how to prepare such peptides are described more fully in 

25 Wyatt, R., et al J. of Virol 69:5723-5733 (1995); Thali, M., etal, J. of Virol 
67:3978-3988 (1993); and U.S. Patent No. 5,817,316, issued October 6, 
1998 which are incorporated herein by reference. See for example Wyatt 
which teaches how to prepare VI /V2 deletions that retain the stem portion 
of the loop. 

30 In one embodiment the gpl20 derivative is designed to be 

permanently attached at the CD4 binding site to sufficient domains of CD4 
to create a conformation of the chemokine binding site approximating that 
of the native gpl20 CD4 complex. 
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An alternative gpl20 derivative is one wherein the hnkers used 
result in a conformation for the derivative so that the discontinuous binding 
site or a discontinuous epitope such as CD4BS or CD4i with the chemokine 
receptor approximates the conformation of the discontinuous binding site 
5 for the chemokine receptor in the wild-type gpl20/CD4 complex. These 
derivatives can readily be made by the person of ordinary skill in the art 
based upon the above described methodologies and screened in the assays 
shown herein to ensure that proper binding is obtained. 

The gpl20 polypeptide can also be bound to at least a portion of a 

10 gp41 polypeptide, namely the coiled coil. Some of these derivatives will lack 
the gp4 1 transmembrane region and will therefore be made as secreted, 
soluble oligomers. For example, gp41 portions lacking the transmembrane 
region but retaining the cytoplasmic region, others truncated beginning with 
the transmembrane region. The gp41 polypeptide may contain additional 

15 cysteine residues, which result in the formation of the SH bonds between 
the monomers thereby stabilizing the complex as a trimer having spikes 
similar to that found in the wild type (as in U.S. Application Serial No. 
09/164,880). 

These immunogenic oligomers can be used to generate an immune 
20 reaction in a host by standard means. For example one can administer the 
polypeptide in adjuvant. In another approach, a DNA sequence encoding 
the envelope protein, e.g., the one based upon gpl20 can be administered 
by standard techniques. The approach of administering the protein is 
presently preferred. 

25 The protein is preferably administered with an adjuvant. Adjuvants 

are well known in the art and include aluminum hydroxide, Ribi adjuvant, 
etc. The administered protein is typically an isolated and purified protein. 
The protein is preferably purified to at least 95% purity, more preferably at 
least 98% pure, and still more preferably at least 99% pure. Methods of 

30 purification while retaining the conformation of the protein are known in the 
art. The purified protein is preferably present in a pharmaceutical 
composition with a pharmaceutically acceptable carrier or diluent present. 

DNA sequences encoding these proteins can readily be made. For 
example, one can use the native gp 160 (or a derivatized gpl20 portion) of 
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any of a range of primate lentiviruses such as HIV- 1 strains which are well 
known in the art and can be modified by known techniques such as deleting 
the undesired regions such as variable loops and to insert desired coding 
sequences such as cysteines and linker segments. In addition to DNA 
5 sequences based upon existing strains, the codons for the various amino 
acid residues are known and one can readily prepare alternative coding 
sequences by standard techniques. 

DNA sequences can be used in a range of animals to express the 
monomer, which then forms into the trimer and generates an immune 
10 reaction. 

DNA sequences can be administered to a host animal by numerous 
methods including vectors such as viral vectors, naked DNA, adjuvant 
assisted DNA catheters, gene gun, liposomes, etc. In one preferred 
embodiment the DNA sequence is administered to a human host as either a 

15 prophylactic or therapeutic treatment to stimulate an immune response, 

most preferably as a prophylactic. One can administer cocktails containing 
multiple DNA sequences encoding a range of HIV env strains. 

Vectors include chemical conjugates such as described in WO 
93/04701, which has targeting moiety (e.g. a ligand to a cellular surface 

20 receptor), and a nucleic acid binding moiety (e.g. polylysine), viral vector 
(e,g. a DNA or RNA viral vector), fusion proteins such as described in 
PCT/US 95/02140 (WO 95/22618) which is a fusion protein containing a 
target moiety (e.g. am antibody specific for a target cell) and a nucleic acid 
binding moiety (e.g. a protamine), plasmids, phage, etc. The vectors can be 

25 chromosomal, non-chromosomal or synthetic. 

Preferred vectors include viral vectors, fusion proteins and chemical 
conjugates. Retroviral vectors include moloney murine leukemia viruses 
and HIV-based viruses. One preferred HIV-based viral vector comprises at 
least two vectors wherein the gag amd pol genes are from an HIV genome 

30 and the env gene is from another virus. DNA viral vectors are preferred. 

These vectors include herpes virus vectors such as a herpes simplex I virus 
(HSV) vector [Geller, A J. et al. J. Neurochem 64: 487 (1995); Lim, F. et al, in 
DNA Cloning: Mammalian Systems, D. Glover, Ed. (Oxford Univ. Press, Oxford 
England) (1995)\ Geller, A.I. et al., Proc Natl Acad. Set U.S.A. 90: 7603 
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(1993); Geller, A.I., et al,, Proc Natl Acad, Sci USA 87: 1149 (1990)], 
adenovirus vectors [LeGal LaSalle et al., Science 259: 988 (1993); Davidson, 
et al., Nat, Genet 3: 219 (1993); Yang, et al., J. Virol 69: 2004 (1995)] and 
adeno-associated virus vectors [Kaplitt, M.G., et al., Nat. Genet S:148 
5 (1994)]. 

The DNA sequence can be operably linked to a promoter that would 
permit expression in the host cell. Such promoters are well known in the 
art and can readily be selected. For example, when expression in a 
mammalian host is desired, a promoter that results in high levels of 

10 expression in such host cells is used. Appropriate polyalkenylation 

sequences are also known and can be selected. Representative examples of 
such promoters, include a retioviral LTR or SV40 promoter, the E. coli. lac 
or trp, the phage lambda P[L jpromoter and other promoters known to 
control expression of genes in prokaryotic or eukaryotic cells or their 

15 viruses. The expression vector also contains a ribosome binding site for 
translation initiation and a transcription terminator. The vector may also 
include appropriate sequences for amplifying expression. 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable 

20 markers. Two appropriate vectors are pKK232-8 and pCM7. Particular 

named bacterial promoters include lad, lacZ, T3, T7, gpt, lambda P[R], P[L 
]and trp. Eukaryotic promoters include CMV immediate early, HSV 
thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse 
metallothionein-I. Selection of the appropriate vector and promoter is well 

25 within the level of ordinary skill in the art. 

In a further embodiment, the present invention relates to host cells 
containing the above-described constructs. The host cell can be a higher 
eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such 
as a yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial 

30 cell. Introduction of the construct into the host ceU can be effected by a 
variety of methods including calcium phosphate transfection, DEAE- 
Dextran mediated transfection, or electroporation (Davis, L., Dibner, M., 
Battey, I., Basic Methods in Molecular Biology, (1986)). 
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Stabilized forms of these complexes can readily be made, for example, 
by conjugates such as a poly(alkylene oxide) conjugate. The conjugate is 
preferably formed by covalently bonding the hydroxyl terminals of the 
poly(alkylene oxide) and a free amino group in the gpl20 portion that will 
5 not affect the conformation of the discontinuous binding site. Other art 
recognized methods of conjugating these materials include amide or ester 
linkages. Covalent linkage as well as non-covalent conjugation such as 
lipophilic or hydrophilic interactions can be used. 

The conjugate can be comprised of non-antigenic polymeric 

10 substances such as dextran, polj^nyl pyrrolidones, polysaccharides, 

starches, polyvinyl alcohols, polyacryl amides or other similar substantially 
non-immunogenic polymers. Polyethylene glycol(PEG) is preferred. Other 
poly(alk\''lenes oxides) include monomethoxy-polyethylene glycol 
polypropylene glycol, block copolymers of polyethylene glycol, and 

15 polypropylene glycol and the like. The polymers can also be distally capped 
with CI -4 alkyls instead of monomethoxy groups. The poly{alkylene oxides) 
used must be soluble in liquid at room temperature. Thus, they preferably 
have a molecular weight from about 200 to about 20,000 daltons, more 
preferably about 2,000 to about 10,000 and still more preferably about 

20 5,000. 

One can administer these stabilized compounds to individuals by a 
variety of means. For example, these antibodies can be included in vaginal 
foams or gels that are used as preventives to avoid infection and applied 
before people have sexual contact. 
25 The peptides or antibodies when used for administration are prepared 

under aseptic conditions with a pharmaceutically acceptable carrier or 
diluent. 

Doses of the pharmaceutical compositions will vary depending upon 
the subject and upon the particular route of administration used. Dosages 
30 can range from 0. 1 to 100,000fxg/kg a day, more preferably 1 to 
10,000|ig/kg. 

Routes of administration include oral, parenteral, rectal, intravaginal, 
topical, nasal, ophthalmic, direct injection, etc. 
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Changes in the viral envelope glycoproteins, in particular in the third 
variable (V3) region of the gpl20 exterior envelope glycoprotein, determine 
tropism-related phenotypes (Cheng-Mayer et aZ., 1990; O'Brien et aZ., 1990; 
Hwang et aZ., Westervelt et al, 1992; Chesebro et al, 1992; Willey et at, 
5 1994). Amino acid changes in the V3 region (Helseth-^et aZ., 1990; Freed et 
al, 1991; Ivanoff et al, 1991; Bergeron et al, 1992; Grimaila et al, 1992; 
Page et al, 1992; Travis et al, 1992) and the binding of antibodies to this 
domain (Putney et al, 1986; Goudsmit et al, 1988; Linsley et al, 1988; 
Rusche et al, 1988; Skinner et al, Javeherian et al, 1989) have been shown 

10 to disrupt a virus entry process other than CD4 binding. Accordingly, one 
can create derivatives and change the phenotype for a particular receptor by 
substituting V3 loops. 

One can inhibit infection by directly blocking receptor binding. This 
can be accomplished by a range of different approaches. For example, 

15 antibodies. One preferred approach is the use of antibodies to the binding 
site for these chemokine receptors. Antibodies to these receptors can be 
prepared by standard means using the stable immunogenic oligomers. For 
example, one can use single chain antibodies to target these binding sites. 
As used herein the inhibition of HIV infection means that as 

20 compared to a control situation infection is reduced, inhibited or prevented. 
Infection is preferably at least 20% less, more preferably at least 40% less, 
even more preferably at least 50% less, still more preferably at least 75% 
less, even more preferably at least 80% less, and yet more preferably at least 
90% less than the control. 

25 One preferred use of the antibodies is to minimize the risk of HIV 

transmission. These antibodies can be included in ointments, foams, 
creams that can be used during sex. For example, they can be administered 
preferably prior to or just after sexual contact such as intercourse. One 
preferred composition would be a vaginal foam containing one of the 

30 antibodies. Another use would be in systemic administration to block HIV-1 
replication in the blood and tissues. The antibodies could also be 
administered in combination with other HIV treatments. 

An exemplary pharmaceutical composition is a therapeutically 
effective amount of a the oligomer, antibody etc. that for examples affects 
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the ability of the receptor to facilitate HIV infection or for the DNA sequence 
or the oligomer that can induce an immune reaction, thereby acting as a 
prophylactic immunogen, optionally included in a pharmaceutically- 
acceptable and compatible carrier. The term "pharmaceutically-acceptable 
5 and compatible carrier" as used herein, and described more fully below, 
includes (i) one or more compatible solid or liquid filler diluents or 
encapsulating substances that are suitable for administration to a human 
or other animal, and/ or (ii) a system, such as a retroviral vector, capable of 
delivering the molecule to a target cell. In the present invention, the term 

10 "carrier" thus denotes an organic or inorganic ingredient, natural or 
synthetic, with which the molecules of the invention are combined to 
facilitate application. The term "therapeutically- effective amount" is that 
amount of the present pharmaceutical compositions which produces a 
desired result or exerts a desired influence on the particular condition being 

15 treated. For example, the amount necessary to raise an immune reaction to 
provide proplyactic protection. Typically when the composition is being 
used as a prophylactic immunogen at least one "boost" will be administered 
at a periodic internal after the initial administration. Various 
concentrations may be used in preparing compositions incorporating the 

20 same ingredient to provide for variations in the age of the patient to be 

treated, the severity of the condition, the duration of the treatment and the 
mode of administration. 

The term "compatible", as used herein, means that the components of 
the pharmaceutical compositions are capable of being commingled with a 

25 small molecule, nucleic acid and/or polypeptides of the present invention, 
and with each other, in a manner such that does not substantially impair 
the desired pharmaceutical efficacy. 

Dose of the pharmaceutical compositions of the invention will vary 
depending on the subject and upon particular route of administration used. 

30 Dosages can range from 0.1 to 100,000 |ig/kg per day, more preferably 1 to 
10,000 M-g/kg. By way of an example only, an overall dose range of from 
about, for example, 1 microgram to about 300 micrograms might be used for 
human use. This dose can be delivered at periodic intervals based upon the 
composition. For example on at least two separate occasions, preferably 
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spaced apart by about 4 weeks. Other compounds might be administered 
daily. Pharmaceutical compositions of the present invention can also be 
administered to a subject according to a variety of other, well-characterized 
protocols. For example, certain currently accepted immunization regimens 
5 can include the following: (i) administration times are a first dose at elected 
date; a second dose at 1 month after first dose; and a third dose at 5 months 
after second dose. See Product Information, Physician's Desk Reference, 
Merck Sharp & Dohme (1990), at 1442-43. (e.g., Hepatitis B Vaccine-type 
protocol); (ii) Recommended administration for children is first dose at 

10 elected date (at age 6 weeks old or older); a second dose at 4-8 weeks after 
first dose; a third dose at 4-8 weeks after second dose; a fourth dose at 6-12 
months after third dose; a fifth dose at age 4-6 years old; and additional 
boosters every 10 years after last dose. See Product Information, Physician's 
Desk Reference^ Merck Sharp 85 Dohme (1990), at 879 (e.g., Diptheria, 

15 Tetanus and Pertussis-type vaccine protocols). Desired time intervals for 
delivery of multiple doses of a particular composition can be determined by 
one of ordinary skill in the art employing no more than routine 
experimentation . 



20 also be administered per se (neat) or in the form of a pharmaceutically 
acceptable salt. When used in medicine, the salts should be 
pharmaceutically acceptable, but non-pharmaceutically acceptable salts 
may conveniently be used to prepare pharmaceutically acceptable salts 
thereof and are not excluded from the scope of this invention. Such 

25 pharmaceutically acceptable salts include, but are not limited to, those 
prepared from the following acids: hydrochloric, hydrobromic, sulfuric, 
nitric, phosphoric, maleic, acetic, salicylic, p- toluene- sulfonic, tartaric, 
citric, methanesulphonic, formic, malonic, succinic, naphthalene-2-sulfonic, 
and benzenesulphonic. Also, pharmaceutically acceptable salts can be 

30 prepared as alkaline metal or alkaline earth salts, such as sodium, 

potassium or calcium salts of the carboxylic acid group. Thus, the present 
invention also provides pharmaceutical compositions, for medical use, 
which comprise nucleic acid and/or polypeptides of the invention together 



The antibodies, DNA sequences or oligomers of the invention may 
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with one or more pharmaceutically acceptable carriers thereof and 
optionally any other therapeutic ingredients. 

The compositions include those suitable for oral, rectal, intravaginal, 
topical, nasal, ophthalmic or parenteral administration, all of which may be 
5 used as routes of administration using the materials of the present 
invention. Other suitable routes of administration include intrathecal 
administration directly into spinal fluid (CSF), direct injection onto an 
arterial surface and intraparenchymal injection directly into targeted areas 
of an organ. Compositions suitable for parenteral administration are 

10 preferred. The term "parenteral" includes subcutaneous injections, 

intravenous, intramuscular, intrastemal injection or infusion techniques. 

The compositions may conveniently be presented in unit dosage form 
and may be prepared by any of the methods well known in the art of 
pharmacy. Methods typically include the step of bringing the active 

1 5 ingredients of the invention into association with a carrier which constitutes 
one or more accessory ingredients. 

Compositions of the present invention suitable for oral administration 
may be presented as discrete units such as capsules, cachets, tablets or 
lozenges, each containing a predetermined amount of the nucleic acid 

20 and/ or polypeptide of the invention in liposomes or as a suspension in an 
aqueous liquor or non-aqueous liquid such as a syrup, an elixir, or an 
emulsion. 

Preferred compositions suitable for parenteral administration 
conveniently comprise a sterile aqueous preparation of the molecule of the 

25 invention which is preferably isotonic with the blood of the recipient. This 
aqueous preparation may be formulated according to known methods using 
those suitable dispersing or wetting agents and suspending agents. The 
sterile injectable preparation may also be a sterile injectable solution or 
suspension in a non-toxic par enterally- acceptable diluent or solvent, for 

30 example as a solution in 1,3-butane diol. Among the acceptable vehicles 

and solvents that may be employed are water, Ringer's solution and isotonic 
sodium chloride solution. In addition, sterile, fixed oils are conventionally 
employed as a solvent or suspending medium. For this purpose any bland 
fixed oil may be employed including synthetic mono- or diglycerides. In 
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addition, fatty acids such as oleic acid find use in the preparation of 
injectibles. 

The term "antibodies" is meant to include monoclonal antibodies, 
polyclonal antibodies and antibodies prepared by recombinant nucleic acid 
5 techniques that are selectively reactive with polypeptides encoded by 
eukaryotic nucleotide sequences of the present invention. The term 
"selectively reactive" refers to those antibodies that react with one or more 
antigenic determinants on e.g. gpl20 and do not react with other 
polypeptides. Antigenic determinants usually consist of chemically active 
10 surface groupings of molecules such as amino acids or sugar side chains 
and have specific three dimensional structural characteristics as well as 
specific charge characteristics. Antibodies can be used for diagnostic 
applications or for research purposes, as well as to block bindiner 
interactions. 

15 For example, cDNA clone encoding a gpl20 of the present invention 

may be expressed in a host using standard techniques (see above; see 
Sambrook et al., Molecular Cloning; A Laboratory Manual, Cold Spring 
Harbor Press, Cold Spring Harbor, New York: 1989) such that 5-20% of the 
total protein that can be recovered from the host is the desired protein. 

20 Recovered proteins can be electrophoresed using PAGE and the appropriate 
protein band can be cut out of the gel. The desired protein sample can then 
be eluted from the gel slice and prepared for immunization. Preferably, one 
would design a stable cell capable of expressing high levels of the proteins 
which be selected and used to generate antibodies 

25 For example, mice can be immunized twice intraperitoneally with 

approximately 50 micrograms of protein immunogen per mouse. Sera from 
such immunized mice can be tested for antibody activity by 
immunohistology or immunocytology on any host system expressing such 
polypeptide and by ELISA with the expressed polypeptide. For 

30 immunohistology, active antibodies of the present invention can be 

identified using a biotin-conjugated anti-mouse immunoglobulin followed by 
avidin-peroxidase and a chromogenic peroxidase substrate. Preparations of 
such reagents are commercially available; for example, from Zymad Corp., 
San Francisco, California. Mice whose sera contain detectable active 



-WO 99/24465 



PCT/US98/24001 



- 38 - 

antibodies according to the invention can be sacrificed three days later and 
their spleens removed for fusion and hybridoma production. Positive 
supematants of such hybridomas can be identified using the assays 
described above and by, for example. Western blot analysis. 
5 To further improve the likelihood of producing an antibody as 

provided by the invention, the amino acid sequence of polypeptides encoded 
by a eukaryotic nucleotide sequence of the present invention may be 
analyzed in order to identify desired portions of amino acid sequence which 
may be associated with receptor binding. For example, polypeptide 

10 sequences may be subjected to computer analysis to identify such sites. 
For preparation of monoclonal antibodies directed toward 
pol3^eptides encoded by a eukaryotic nucleotide sequence of the invention, 
any technique that provides for the production of antibody molecules by 
continuous cell lines may be used. For example, the hybridoma technique 

15 originally developed by Kohler and Milstein (Nature, 256: 495-497, 1973), as 
well as the trioma technique, the human B-cell hybridoma technique 
(Kozbor et al., Immunology Today, 4:72), and the EBV- hybridoma technique 
to produce human monoclonal antibodies, and the like, are within the scope 
of the present invention. See, generally Larrick et al., U.S. Patent 5,001,065 

20 and references cited therein. Further, single-chain antibody (SCA) methods 
are also available to produce antibodies against polypeptides encoded by a 
eukaryotic nucleotide sequence of the invention (Ladner et al. U.S. patents 
4,704,694 and 4,976,778). 

The monoclonal antibodies may be human monoclonal antibodies or 

25 chimeric humatn-mouse (or other species) monoclonal antibodies. The 

present invention provides for antibody molecules as well as fragments of 
such antibody molecules. 

Those of ordinary skill in the art will recognize that a large variety of 
possible moieties can be coupled to the resultant antibodies or to other 

30 molecules of the invention. See, for example, "Conjugate Vaccines", 

Contributions to Microbiology and Immunology, J.M. Cruse and R.E. Lewis, 
Jr (eds), Carger Press, New York, (1989), the entire contents of which are 
incorporated herein by reference. 
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Coupling may be accomplished by any chemical reaction that will 
bind the two molecules so long as the antibody and the other moiety retain 
their respective activities. This linkage can include many chemical 
mechanisms, for instance covalent binding, affinity binding, intercalation, 
5 coordinate binding and complexation. The preferred binding is, however, 
covalent binding. Covalent binding can be achieved either by direct 
condensation of existing side chains or by the incorporation of external 
bridging molecules. Many bivalent or polyvalent linking agents are useful in 
coupling protein molecules, such as the antibodies of the present invention, 

10 to other molecules. For example, representative coupling agents can 

include organic compounds such as thioesters, carbodiimides, succinimide 
esters, diisocyanates, glutaraldehydes, diazobenzenes and hexamethylene 
diamines. This listing is not intended to be exhaustive of the various 
classes of coupling agents known in the art but, rather, is exemplary of the 

15 more common coupling agents. (See Killen and Lindstrom 1984, "Specific 
killing of lymphocytes that cause experimental Autoimmune Myasthenia 
Gravis by toxin- acetylcholine receptor conjugates." Jour. Immun. 
133:1335-2549; Jansen, F.K., H.E. Blythman, D. Carriere, P. Casella, O. 
Gros, P. Gros, J.C. Laurent, F. Paolucci, B. Pau, P. Poncelet, G. Richer, H. 

20 Vidal, and G.A. Voisin. 1982. "Immunotoxins: Hybrid molecules combining 
high specificity and potent cytotoxicity". Immunological Reviews 62:185- 
216; and Vitetta et al., supra). 

Preferred linkers are described in the literature. See, for example, 
Ramakrishnan, S. et al., Cancer Res. 44:201-208 (1984) describing use of 

25 MBS (M-maleimidobenzoyl-N-hydroxysuccinimide ester). See also, 

Umemoto et al. U.S. Patent 5,030,719, describing use of halogenated acetyl 
hydrazide derivative coupled to an antibody by way of an oligopeptide linker. 
Particularly preferred linkers include: (i) EDC (l-ethyl-3-{3-dimethylamino- 
propyl) carbodiimide hydrochloride; (ii) SMPT (4-succinimidyloxycarbonyl- 

30 alpha-methyl-alpha-(2-pyTidyl-dithio)-toluene (Pierce Chem, Co., Cat. 
(21558G); (iii) SPDP (succinimidyl-6 [3-(2-pyridyldithio) propionamido] 
hexanoate (Pierce Chem. Co., Cat #21651G); (iv) Sulfo-LC-SPDP 
(sulfosuccinimidyl 6 [3-(2-pyridyldithio)-propianamide] hexanoate (Pierce 
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Chem. Co. Cat. #2165-G); and (v) sulfo-NHS (N-hydroxysulfo-succinimide: 
Pierce Chem. Co., Cat. #24510) conjugated to EDC. 

The linkers described above contain components that have different 
attributes, thus leading to conjugates with differing physio-chemical 
5 properties. For example, sulfo-NHS esters of alkyl carboxylates are more 
stable than sulfo-NHS esters of aromatic carboxylates. NHS-ester 
containing linkers are less soluble than sulfo-NHS esters. Further, the 
linker SMPT contains a sterically hindered disulfide bond, and can form 
conjugates vdth increased stability. Disulfide linkages, are in general, less 

10 stable than other linkages because the disulfide linkage is cleaved in vitro, 
resulting in less conjugate available. Sulfo-NHS, in particular, can enhance 
the stability of carbodimide couplings. Carbodimide couplings (such as 
EDC) when used in conjunction with sulfo-NHS, forms esters that are more 
resistant to hydrolysis than the carbodimide coupling reaction alone. 

1 5 Antibodies of the present invention can be detected by appropriate 

assays, such as the direct binding assay discussed earlier and by other 
conventional types of immunoassays. For exeunple, a sandwich assay can 
be performed in which the receptor or fragment thereof is affixed to a solid 
phase. Incubation is maintained for a sufficient period of time to allow the 

20 antibody in the sample to bind to the immobiUzed polypeptide on the solid 
phase. After this first incubation, the solid phase is separated from the 
sample. The solid phase is washed to remove unbound materials and 
interfering substances such as non-specific proteins which may also be 
present in the sample. The solid phase containing the antibody of interest 

25 bound to the immobilized polypeptide of the present invention is 

subsequently incubated with labeled antibody or antibody bound to a 
coupling agent such as biotin or avidin. Labels for antibodies are well- 
known in the art and include radionuclides, enzymes (e.g. maleate 
dehydrogenase, horseradish peroxidase, glucose oxidase, catalase), fluors 

30 (fluorescein isothiocyanate, rhodamine, phycocyanin, fluorescamine) , biotin, 
and the like. The labeled antibodies are incubated with the solid and the 
label bound to the solid phase is measured, the amount of the label detected 
serving as a measure of the amount of anti-urea transporter antibody 



- wo 99/24465 



PCT/US98/24001 



- 41 - 

present in the sample. These and other immunoassays can be easily 
performed by those of ordinary skill in the art. 

The following Examples serve to illustrate the present invention, and 
are not intended to limit the invention in any manner. 
5 Specific groups of HIV- 1 neutralizing antibodies directed against the 

gpl20 V3 loop or CD4-induced (CD4i) epitopes were able to block the 
binding of gpl20-sCD4 complexes to CCR5-expressing cells (3,4). The CD4i 
epitopes are conserved, discontinuous gpl20 structures that are exposed 
better after CD4 binding (5). Mutagenic analysis suggested that elements of 

10 the conserved stem of the V1V2 stem-loop and of the fourth conserved 
region of gpl20 comprise the CD4i epitopes (5). The following examples 
demonstrate that conserved gpl20 residues near or within the CD4i 
epitopes are critical for CCR5 binding. 

An assay was established that could assess the CCR5-binding ability 

15 of a panel of HIV- 1 gpl20 glycoprotein mutants. The mutants were created 
by the introduction of single amino acid changes in gpl20 residues near or 
within regions previously shown to be important for the integrity of the CD4i 
epitopes (5). The wtA glycoprotein, which lacks the VI /V2 variable loops 
and the N-terminus and is derived from the YU2 primary macrophage-tropic 

20 HIV-1 isolate (7), was the starting point for the studies (Fig. lA-E). This 

protein was chosen because it had been shown to bind CD4 and CCR5 with 
high aiffmity (3,8,9). Furthermore, the use of this protein minimized the 
opportunities for indirect effects of gpl20 amino acid changes on CCR5 
binding (e.g., by repositioning the V1/V2 loops, which can mask CD4i 

25 epitopes (9)). Metabolically labeled wtA and mutant derivatives were 
produced in 293T cells and incubated with mouse LI. 2 cells stably 
expressing human CCR5 (3), in either the absence or presence of sCD4. The 
cells were washed and lysed, and bound gpl20 protein was detected by 
precipitation with a mixture of sera from HIV-1- infected individuals (10). 

30 The WtA protein efficiently bound to the L1.2-CCR5 cells in the 

presence of sCD4. Binding was dramatically reduced when sCD4 was not 
present in the assay. The wtA protein binding to the L1.2-CCR5 cells was 
inhibited by preincubation of the wtA protein with the 17b antibody. 
Binding was also inhibited by incubation of the L1.2-CCR5 cells with the 
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2D7 antibody against CCR5 (Cll) or with the CCR5 ligand, MIP-1,8 (12). 
The Cll antibody, which is directed against a gpl20 region dispensable for 
CCR5 binding (3), did not block the binding of the wtA protein to the L1.2- 
CCR5 cells (data not shown). The wtA protein did not bind appreciably to 
5 the parental LI. 2 cells not expressing CCR5 even in the presence of sCD4. 
These results indicate that the wtA protein binds CCR5 in a specific, CD4- 
dependent manner. 

The binding of the panel of gpl20 mutants to the L1.2-CCR5 cells in 
the absence and presence of sCD4 was measured. The recognition of the 

10 mutant proteins by sCD4 and by monoclonal antibodies that recognize 

discontinuous gpl20 epitopes (5,13) was assessed in parallel (10). Changes 
in several gpl20 amino acids resulted in dramatic reductions in the ability 
of the protein to bind to L1.2-CCR5 cells in the presence of sCD4 (Table 1). 
In some cases (257 T/D, 370 E/Q and 383 F/S), the attenuated CD4- 

1 5 binding ability of the mutant proteins could account for the observed 

reduction in binding to the L1.2-CCR5 cells. In most cases, however, the 
mutant proteins that were deficient in CCR5 binding still bound sCD4 and 
at least one of the monoclonal antibodies recognizing discontinuous gpl20 
epitopes. As expected, some of the introduced amino acid changes 

20 decreased recognition by the 17b antibody. Interestingly, two of the gpl20 
amino acid changes (437 P/A, 442 Q/L) resulted in an increase in CCR5 
binding compared with the wtA protein, even though CD4 binding was not 
significantly increased. In the absence of sCD4, the 437 P/A and 442 Q/L 
envelope glycoprotein mutants bound to the L1.2-CCR5 cells slightly better 

25 than the other mutants and the wtA protein, which exhibited very low levels 
of binding (data not shown). 

Table 1. Phenotypes of HIV- 1 gp 120 mutants. The ability of the wtA 
and mutant glycoproteins to bind CCR5 expressed on LI. 2 cells was 
determined (10). The recognition of the wtA and mutant glycoproteins by 

30 sCD4 and monoclonal antibodies was determined (10). All values reported 
are relative to those seen for the wtA protein. Values represent the average 
of at least two independent experiments and exhibited less than 30% 
variation from the value shown. 
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Protein 

(Fractional Solvent 

Accessibility)* CCR5 BindingH 



5 


wtA 


1.00 




107 D/R 


1.02 




114 Q/L 


1.22 




117 K/D (0.45) 


0.15 




121 K/D (0.57) 


0.07 


10 


122 US 


0.98 




123 T/D (0.49) 


0.08 




197 N/D 


1.33 




199 S/L 


1.50 




200 V/S 


0.84 


15 


201 I/A 


0.46 




203 Q/L 


0.68 




207 KID (0.23) 


0.0 




209 S/L 


1.00 




210 F/S 


0.65 


20 


211 E/K 


0.73 




257 T/D 


0.05 




295 N/E 


0.86 




308 N/D 


0.31 




317 US 


0.08 


25 


330 HIA 


0.22 




AV3 (/ -298-329) 


0.0 




370 E/Q 


0.17 




372 V/S 


0.85 




373 T/D 


0.48 


30 


377 N/E (0.04) 


0.22 




381 E/R (0.07) 


0.07 




383 F/S 


0.04 




386 N/D 


1.22 




419 R/D (0.82) 


0.19 


35 


420 I/R (0.14) 


0.06 




421 K/D (0.32) 


0.07 




422 Q/L (0.35) 


0.07 




423 I/S 


0.61 




424 I/S 


0 37 


40 


426 M/A 


0.75 




429 E/R 


1.54 




432 K/A 


0.61 




434 MIA 


1.22 




435 Y/S 


0.21 


45 


436 A/S 


0.98 




437 P/A 


1.79 




438 P/A (0.28) 


0.06 




439 I/A 


0.45 



Ligand Binding 



sCD4 


17b 


CGIO 


F105 


1.00 


1.00 


1.00 


1.00 


1.02 


0.97 


1.11 


1.14 


0.79 


0.73 


0.71 


0.75 


0.74 


0.64 


0.42 


0.83 


0.73 


0.11 


0.0 


0.99 


0.84 


1.07 


0.18 


1.11 


0.99 


1.06 


0.0 


1.25 


1.34 


0.80 


0.81 


1.11 


1.32 


0.94 


1.03 


1.04 


0.91 


1.05 


0.49 


1.06 


0.90 


0.67 


0.84 


0.81 


0.85 


0.88 


0.52 


0.93 


0.85 


0.46 


0.13 


0.98 


1.11 


0.85 


1.01 


1.00 


0.81 


0.81 


0.85 


0.74 


1.13 


1.03 


1.12 


1.24 


0.0 


0.49 


0.06 


0.0 


0.75 


0.73 


0.98 


0.79 


1.10 


0.89 


0.93 


1.03 


1.12 


1.05 


1.13 


1.03 


0.75 


0.55 


0.66 


0.64 


0.80 


0.08 


1.27 


0.93 


0.0 


1.04 


0.12 


0.0 


1.03 


1.08 


1.09 


0.44 


1.12 


1.10 


1.16 


1.10 


0.71 


0.52 


0.65 


0.60 


0.81 


0.75 


0.29 


0.96 


0.0 


0.0 


0.07 


0.0 


1.14 


0.97 


0.90 


0.97 


0.86 


0.02 


0.48 


0.82 


0.59 


0.0 


0.72 


0.72 


0.86 


0.19 


0.0 


0.0 


0.53 


0.0 


0.20 


0.55 


0.97 


0.05 


0.30 


1.03 


0.25 


0.48 


0.83 


0.81 


0.69 


0.69 


0.72 


1.11 


1.17 


1.00 


1.05 


0.82 


1.0 


0.92 


0.0 


1.45 


0.90 


0.65 


0.07 


1.04 


0.33 


0.22 


0.29 


1.00 


1.05 


0.91 


0.99 


1.23 


0.80 


0.68 


0.78 


0.82 


1.18 


1.00 


1.13 


1.18 


0.68 


0.76 


0.76 


0.84 
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Protein 

(Fractional Solvent 
Accessibility)* 



CCR5 BindingH 



sCD4 17b 



Ligand Binding 



CGIO F105 



5 



440 R/D (0.43) 

441 GN(0.91) 

442 Q/L 



0.09 

0.0 

2.00 

0.25 

1.03 



1.03 
0.67 
1.11 
0.79 
0.59 



1.05 
0.70 
0,74 
0.67 
0.81 



1.05 
0.62 
1.05 
0.94 
0.74 



1.13 
0.78 
0.83 
0.74 
0.0 



444 RID (0.80) 
474 D/R 



10 



The number of the mutant wtA glycoproteins is based on the 



sequence of the prototypic HXBc2 gpl20 glycoprotein (24), with 1 
representing the initiator methionine. The wild-type YU2 gpl20 residue is 
listed, followed by the substituted residue. Amino acid abbreviations: A, 

15 alanine; D, aspartic acid; E, glutamic acid; F. phenylalanine; G. glycine; H. 
histidine; I, isoleucine; K, Lysine; L, leucine; M, methionine; N. asparagine; 
P. proline; Q. glutamine; R. arginine; S. serine; T. threonine; V, valine; Y. 
tjTosine. The fractional solvent accessibilities associated with gpl20 
residues in which changes specifically disrupted CCR5 binding are shown in 

20 parentheses. Fractional solvent accessibility was calculated as the ratio of 
solvent-accessible surface area for atoms of amino-acid residue X in the 
gpl20 core (without carbohydrate moieties) to the area obtained after 
reducing the structure to a Gly-X-Gly tripeptide (24). Values cited are for 
side-chain atoms except for glycine 44 1 where the value for all atoms is 

25 given. 

HThe binding of the wtA glycoprotein to L1.2-CCR5 cells was shown 
to be linearly related to the concentration of wtA protein in the transfected 
293T cell supematants, over the range of concentrations used in these 
experiments. The total amount of wtA and mutant glycoprotein present in 
30 the 29 3T cell supematants was estimated by precipitation with an excess of 
a mixture of sera from HIV- 1-infected individuals. The amount of wtA and 
mutant glycoprotein bound to the L1.2-CCR5 cells was determined as 
described (10). The value for CCR5 binding was calculated using the 
following formula: 
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CCR5 binding Bound mutant protein Total wtA protein 

= Bound WtA protein x Total mutant protein 

The recognition of the wtA and mutant glycoproteins by sCD4 and 
5 antibodies was determined by precipitation of radiolabeled envelope 

glycoproteins in transfected 293T cell supernatants as described (10). In 
parallel, the labeled envelope glycoproteins were precipitated with an excess 
of a mixture of sera from HIV- 1 -infected individuals. The value for ligand 
binding was calculated using the following formula: 

0 

Ligand binding = Mutant protein ueand x wtA protein serum mixture 

WtAproteiniigand Mutant proteinserum mixture 



In the sCD4 and 17b columns, the values in bold indicate gpl20 
15 residues that exhibit decreased solvent accessibility in the presence of the 
two-domain sCD4 or 17b Fab, respectively, in the ternary complex (6). 
Changes in solvent accessibility were calculated using the MS program of 
Michael Connolly. 

Graphics. Molecular graphics were produced using Midas-Plus (University 
20 of California, San Francisco) and GRASP. 30 

Assignment of variability. Variability in gpl20 residues was assessed using 
an alignment of sequences derived from approximately 400 HIV-1, HIV-2 
and simian immunodeficiency viruses. ^3 Residues were assigned variability 
indices and color coded as follows: 
25 Red: conserved in all primate immunodeficiency viruses; 

Orange: conserved in all HIV-1, including groups M and O and 

chimpanzee isolates; 
Yellow: some variation among HIV- 1 isolates (divergence from 

the consensus sequence in 1-8 of the 12 HIV-1 groups 
30 examined) . 

Green: variable among HIV- 1 isolates (divergence from the 

consensus sequence in, 9 of the 12 HIV-1 groups 
examined) . 
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Molecular modeling . Residues 88, 89, and 397-409, which are disordered 
in the ternary complex crystals (H. Deng et al., Nature, 381:661-666 (1996), 
were built manually using the program TOM. For the V4 loop (residues 397- 
409), a dominant constraint was the distance between the ordered residues 
5 396 and 410 (C - C distance of 26.88 A). For the carbohydrate, 

examination of the N-linked carbohydrate in several crystal structures (e.g. 
Ifc2, Igly, lite) showed that the core common to both high-mannose and 
complex N-linked sugars, (NAG)2(MAN)3, did not differ greatly in 
conformation after alignment of the first NAG. This core, which represents 

10 roughly half the total glycosylation for a typical N-linked site, was built onto 
each of the 18 consensus N-linked glycosylation sites found on the HXBc2 
gpl20 core. The stereochemistry of this initial model was refined using 
simulated annealing in XPLOR. Briefly, the model was heated to between 
2,500° and 3,500°K, and "slow cooled" in steps of 25** to 300**K. At each 

15 step, molecular dynamics were performed with the core gpl20 fixed, 
allowing only the modeled residues and carbohydrate (including any 
attached Asn) to move. In three separate runs, performing molecular 
dynamics for 5 fs/step, all steric clashes could be removed and the geometry 
idealized, with an average root mean square (RMS) of carbohydrate 

20 movement of only '-'3,5A. Four subsequent runs were made using dynamic 
times of between 50-75 fs/step. The carbohydrate positions obtained from 
these runs differed more substantially from those in the starting model 
(average carbohydrate RMS difference of roughly 8A). Two of the models 
from these longer annealings were much more similar to each other than to 

25 the rest (RMS differences in carbohydrate of -4 A versus -'8A for all other 

models). One had been heated to 3, SOCK with dynamics of 75 fs/step. The 
other (shown in the figures here) was heated to only 2,500°K with d3niamics 
of 50 fs/step. In general the RMS movement of the NAG sugars was roughly 
half the RMS movement of the MAN sugars, reflecting greater 

30 conformational flexibility further from the protein surface. 

In primary sequence, human and simian immunodeficiency virus 
gpl20 glycoproteins consist of five variable regions (V1-V5) interposed 
among more conserved regions (G. Alkhatib et al.. Science 272:1955-1958 
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(1996)). Variable regions V1-V4 form, exposed loops anchored at their bases 
by disulfide bonds (L. Wu et al., Nature, 384:179-183 (1996)). 
Neutralizing antibodies recognize both variable and conserved gpl20 
structures. The V2 and V3 loops contain epitopes for strain-restricted 
5 neutralizing antibodies (E. Emini et al., Nature, 355:728-730 (1992); S. 

Putney et al., Science, 234:1392-1395 (1986); and C. Bruck et al., Collogue 
des Cent Garde, 227-233 (1990). More broadly neutralizing antibodies 
recognize discontinuous, conserved epitopes in three regions of the gpl20 
glycoprotein (Table 2). In HIV- 1 -infected humans, the most abundant of 

10 these are directed against the CD4 binding site (CD4BS) and block gpl20- 
CD4 interaction (Y. Feng et al., Science 272:872-877 (1996); and H, Choe et 
al,, Cell, 85:1135-1148 (1996)). Less common are antibodies against 
epitopes induced or exposed upon CD4 binding (CD4i) (P. Herman et al.. 
Nature, 345:622-625 (1990)). Both CD4i and V3 antibodies disrupt the 

15 binding of gpl20-CD4 complexes to chemokine receptors (B. Doranz et al.. 
Cell, 85:1149-1158 (1996); and T. Draoic et al., Nature, 381 :667-673 
(1996)), A third gpl20 neutralization epitope is defined by a unique 
monoclonal antibody, 2G12, (W. Robey et al., Proc. Natl Acad. Sci. U.S.A., 
83:7023-7027 (1986)) which does not efficiently block receptor binding T. 

20 Draoic et al., Nature, 381 :667-673 (1996)). 

The X-ray crystal structure of an HIV-1 gpl20 core in a ternary 
complex with two-domain soluble CD4 and the Fab fragment of the CD4i 
antibody, 17b. The gpl20 core lacks the V1/V2 and V3 variable loops, as 
well as N- and C-terminal sequences, which interact with the gp41 

25 glycoprotein (M. Lu et al., Nature Structural Biol, 2:1075-1082 (1995)), and 
is enz3TTiatically deglycosylated (H. Deng et al.. Nature, 381:661-666 (1996); 
and K. Steimer et al., Science, 254:105-108 (1991)). Despite these 
modifications, the gpl20 core binds CD4 and antibodies against CD4BS and 
CD4i epitopes (K. Steimer et al.. Science, 254:105-108 (1991); and M. Posner 

30 et al., J. Immunol, 146:4325-4332 (1991)) and thus retains structural 

integrity. The gpl20 core is composed of an inner domain, an outer domain 
and a third element, the "bridging sheet"( H. Deng et al.. Nature, 381:661- 
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666 (1996)) (Figure la). All three structural elements contribute, either 
directly or indirectly, to CD4 and chemokine receptor binding. ^2 

Although generally well-conserved compared with the five variable 
regions, some variability in the surface of the gpl20 core is evident when 
5 the sequences of all primate immunodeficiency viruses are analyzed. This 
variability is disproportionately associated with the surface of the outer 
domain proximal to the V4 and V5 regions and removed from the receptor- 
binding regions (Figure IC and 2). The A, C, D and E surface loops (12) 
contribute to the variability of this surface. The potential N-linked 

10 glycosylation sites present in the gpl20 core are concentrated in this 

variable half of the protein. In fact, the only conserved residues apparent on 
this relatively variable surface are asparagine 356 and threonine/ serine 
358, which constitute a complex carbohydrate addition site within the E 
loop. Since most carbohydrate moieties may appear as "self to the immune 

15 system, the extensive glycosylation of the outer domain surface should 
render it less visible to immune surveillance. This helps to explain why 
antibodies directed against this gpl20 surface have been identified so 
infrequently. 

The receptor-binding regions retained in the gpl20 core are well- 
20 conserved among primate immunodeficiency viruses (H. Deng et al., Nature, 
381:661-666 (1996)). Also highly conserved is the surface of the inner 
domain spanned by the pi helix and located opposite the variable surface 
described above. This surface is likely to interact with gp41 and/or with N- 
terminal gpl20 segments absent from the gpl20 core. This inner domain 
25 surface and the receptor-binding regions are devoid of glycosylation. 

In conjunction with prior mutagenic and antibody competition analyses, (A. 
Pinter et al., J. Virol, , 63:2674-2679 (1989); M. Lu et al.. Nature Structural 
Biol, 2:1075-1082 (1995); P. Berman et al., Nature, 345:622-625 (1990); W. 
Robey et al., Proc, Natl. Acad. Sci. U.S.A., 83:7023-7027 (1986); J. Rusche et 
30 al., Proc. Natl Acad. Sci. U.S.A., 85:3198-3202 (1988); and K. Steimer et al.. 
Science, 254:105-108 (1991)) the gpl20 core structure reveals for the first 
time the spatial positioning of the conserved gpl20 neutralization epitopes. 
Although the major variable loops eire either absent (V1/V2 and V3) or 
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poorly resolved (V4) in the gpl20 core structure, their approximate positions 
can be deduced (Figure 3A). The conserved gpl20 neutralization epitopes 
are discussed in relation to these variable loops and to the variable, 
glycosylated core surface. 
5 a) CD4i epitopes . The gpl20 epitope recognized by the CD4i 

antibody, 17b, can be directly visualized in the crystallized ternary complex 
(Figures 3B and 3C). Strands from the gpl20 fourth conserved (C4) region 
and the VI /V2 stem contribute to an antiparallel p-sheet (the "bridging 
sheet" (see Figure lA)) that contacts the antibody. The vast majority of 

10 gpl20 residues previously implicated in formation of the CD4i epitopes^^ 

(Table 2) are located either within this p-sheet or in nearby structures. With 
the exception of Thr 202 and Met 434, the gpl20 residues in contact with 
the 17b Fab are highly conserved among HIV-1 isolates (Figure IC, 2 and 
3A). The prominent ("male") CDR3 loop of the 17b heavy chain dominates 

15 the contacts with gpl20, with additional contacts through the heavy chain 
CDR2 (H. Deng et al.. Nature, 381:661-666 (1996)). Unusually, there are 
minimal 17b light chain contacts, leaving a large gap between the gpl20 
core and most of the 17b light chain surface. In the complete gpl20 
glycoprotein, this gap is likely occupied by the V3 loop. This is consistent 

20 with the position and orientation of the V3 stem on the gpl20 core structure 
(H. Deng et al.. Nature, 381:661-666 (1996)), the effect of V3 deletions on the 
binding of CD4i antibodies in the absence of soluble CD4 (M. Posner et al., 
J. Immunol, 146:4325-4332 (1991)), the competition of some V3 -directed 
antibodies v^dth CD4i antibodies A. Pinter et al., J. Virol, 63:2674-2679 

25 (1989)), and the ability of both antibody groups to block chemokine receptor 
binding (B. Doranz et al.. Cell, 85:1149-1158 (1996); and T. Draoic et al.. 
Nature, 381 :667-673 (1996). The chemokine receptor-binding region of 
gpl20 likely consists of elements near or within the "bridging sheet" and the 
V3 loop. 

30 The V2 loop likely resides on the side of the 17b epitope opposite the 

V3 loop (Figure 3A). The VI /V2 loops, which vary from 57 to 86 residues in 
length, 13 are dispensable for HIV-1 replication (M. Posner et al., J. Immunol, 
146:4325-4332 (1991)); and R. Wyatt et al., J. Virol, 69:5723-5733 (1995)) 
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but decrease the sensitivity of viruses to neutralization by antibodies 
against V3 and CD4i epitopes (R. Wyatt et al., J, Virol, 69:5723-5733 
(1995)). The latter effect is mediated primarily by the V2 loop, (M. Posner et 
al., J. Immunol, 146:4325-4332 (1991)) suggesting that part of the V2 loop 
5 folds back along the VI/ V2 stem to mask the "bridging sheet" and adjacent 
V3 loop. The proximity of the V2 and V3 loops is supported by the 
observation that, in monkeys infected with simian-human 
immunodeficiency viruses (SHIVs), neutralizing antibodies are raised 
against discontinuous epitopes with V2 and V3 components (B. Etemad- 

10 Moghadam and J. Sodroski). The CD4i epitopes are apparently masked by 
the flanking V2 and V3 loops, requiring the evolution of antibodies with 
protruding ("male") CDRs to access these conserved epitopes. CD4 binding 
has been suggested to reposition the VI /V2 loops, thus exposing the CD4i 
epitopes (M. Posner et al., J. Immunol, 146:4325-4332 (1991)). The 

15 presence of contacts between the VI /V2 stem and CD4 in the crystal 
structured is consistent with this model. 

b) CD4BS epitopes. CD4 makes a number of contacts within a 
recessed pocket on the gpl20 surface. The gpl20-CD4 interface includes 
two cavities, one water-filled and bounded equally by both proteins, the 

20 other extending into the gpl20 interior and contacting CD4 only at 

phenylalanine 43 (H. Deng et al., Nature, 381:661-666 (1996)). Tables 1, 2 
and Figures 3B and 3C show the gpl20 residues implicated in the formation 
of CD4BS epitopes recognized by eight representative antibodies. CD4BS 
epitopes are uniformly disrupted by changes in Asp 368 and Glu 370, (J. 

25 Rusche et al., Proc. Natl Acad, Sci. U.S.A., 85:3198-3202 (1988)) which 

surround the opening of the "Phe 43 cavity". These residues are located on 
a ridge at the intersection of the two receptor-binding gpl20 surfaces, 
consistent with competition studies suggesting that CD4BS epitopes overlap 
both the CD4i epitopes and the binding site for CD4 (A. Pinter et al. , J. 

30 Virol, 63:2674-2679 (1989); and P. Berman et al., Nature, 345:622-625 

(1990)). The location of the gpl20 residues implicated in the formation of 
the CD4BS epitopes suggests that important elements of the CD4-binding 
surface of gpl20 are accessible to antibodies. 
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Some CD4BS antibodies, like IgGlbl2, are particularly potent at 
neutralizing HIV-1 (J. Robinson et al., AIDS Res. Hum. Retro, 6:567-580 
(1990)). IgGlbl2 binding is disrupted by gpl20 changes that affect the 
binding of other CD4BS antibodies but, atypically, is sensitive to changes in 
5 the VI /V2 stem-loop structured The observation that some well-conserved 
residues in the gpl20 VI /V2 stem contact CD4 {H. Deng et al., Nature, 
381:661-666 (1996)) raises the possibility that this protruding structure 
also contributes to the IgGlbl2 epitope. This might increase the ability of 
the cintibody to access the assembled envelope glycoprotein trimer, thus 

1 0 increasing neutralizing capability. 

While the CD4BS epitopes and the CD4-binding site overlap, several 
observations demonstrate that the binding of CD4BS antibodies differs from 
that of CD4. Changes in Trp 427, a gpl20 residue that contacts both the 
"Phe 43 cavity" and CD4, uniformly disrupt CD4 binding but affect the 

15 binding of only some CD4BS antibodies (Table 2). Conversely, some 

changes in other cavity-lining gpl20 residues, Ser 256 and Thr 257, affect 
the binding of CD4BS antibodies more than the binding of CD4 (J. Rusche 
et al., Proc. Natl Acad. Sci, U.S.A., 85:3198-3202 (1988)). Since the recessed 
position of Ser 256 and Thr 257 in the current crystal structure (Figure 3B 

20 and 3C) makes direct contacts with antibody unlikely, either the effects of 
changes in these residues are indirect or the CD4BS antibodies recognize a 
gpl20 conformation that differs from the CD4-bound state. With respect to 
the latter possibility, several of the residues implicated in the integrity of the 
CD4BS epitopes are located in the interface between the inner and outer 

25 gpl20 domains. CD4BS antibodies might recognize a gpl20 conformation 
in which the spatial relationship between the domains is altered compared 
with the CD4-bound state, thus allowing better surface exposure of these 
residues. Differences between the CD4BS epitopes and the CD4-binding 
site create opportunities for neutralization escape (J. Rusche et al., Proc. 

30 Natl Acad. Sci aS.A,, 85:3198-3202 (1988)). The gpl20 residues 

surrounding the "Phe 43" cavity are highly conserved among primate 
immunodeficiency viruses (Figure 3A), but the observed modest variation in 
adjacent surface-accessible residues (e.g., Pro 369, Thr 373 and Lys 432) 
could account for decreased recognition of the gpl20 glycoprotein from 
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some geographic clades of HIV- 1 by CD4BS antibodies (S. Tilley et al., Res. 
Virol, 142:247-259 (1991)). Additional potential for variation near or within 
the CD4BS epitopes is created by the unusual water-filled cavity in the 
gpl20-CD4 binding interface, since CD4 binding can apparently tolerate 
5 change in the gpl20 residues contacting this cavity (H. Deng et al., Nature, 
381:661-666 (1996)). 

The recessed nature of the CD4 binding pocket on gpl20 (Figure IB) 
can delay the generation of high-affinity antibodies against the CD4BS 
epitopes and may afford opportunities to minimize the antiviral efficacy of 

10 such antibodies once they are elicited. The degree of recession is probably 
much greater on the full-length, glycosylated gpl20 than is evident on the 
crystallized gpl20 core. The recessed pocket is flanked on one side by the 
VI /V2 stem-loop structure. The characterization of HIV- 1 escape mutants 
from the IgGlbl2 CD4BS antibody and the mapping of several V2 

15 conformational epitopes support a model in which the V2 loop folds back 
along the VI /V2 stem, with V2 residues 183-188 proximal to Asp 368 and 
Glu 370. This model is consistent vAth observations that VI /V2 changes, in 
combination with V3 changes, can alter the exposure of the adjacent CD4BS 
epitopes, particularly on the assembled trimer (R. Wyatt et al., J. Virol, 

20 67:4557-4565 (1993)). The high temperature factors associated with the 
VI /V2 stems imply flexibility in this protruding element, expanding the 
potential range of space occupied by the VI /V2 stem-loop structure. This 
could enhance masking of the adjacent CD4BS and CD4i gpl20 epitopes 
and divert antibody responses towards the variable loops. 

25 Glycosylation can modify the interaction of antibodies with CD4BS 

epitopes. The D loop, on the rim of the CD4-binding pocket opposite the 
VI /V2 stem, contains a well-conserved glycosylation site, asparagine 276. 
Changes in this site and at the adjacent alanine 281 have been associated 
with escape from the neutralizing activity of patient sera (D. Ho et al., J. 

30 Virol, 65:489-493 (1991)) and have been seen in SHIVs extensively 
passaged in monkeys (M. Thali et al., J. Virol, 67:39783988 (1993)). 
Another conserved glycosylation site at asparagine 386 lies adjacent to both 
CD4BS and CD4i epitopes (Figure ID) and could diminish antibody 
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responses against those sites. Additionally, in various HIV-1 strains, 
carbohydrates are added to the V2 loop segment (residues 186-188) thought 
to be proximal to the CD4BS epitopes. 

c) The 2G12 epitope . The integrity of the 2G12 epitope is 
5 disrupted by changes in gpl20 glycosylation, either by glycosidase 

treatment or mutagenic alteration of specific N-linked carbohydrate addition 
sites (W. Robey et al., Proc. Natl Acad. Set U,S,A,, 83:7023-7027 (1986)). 
These sites are located on the relatively variable surface of the gpl20 outer 
domain, opposite to and approximately 25 A away from the CD4 binding site 

10 (Figures IE, 3B and 3C). The gpl20 glycoprotein synthesized in mammalian 
cells exhibits a dense concentration of high-mannose sugars in this region 
(Figure 3A). Even in the enzymatically deglycosylated gpl20 core, 
carbohydrate residues constitute much of this surface. 2G12 likely binds at 
least in part to these cairbohydrates, explaining the surprising conservation 

15 of the 2G12 epitope despite the variability of the underlying protein surface, 
which includes the stem of the V3 loop and the V4 variable region. The 
inclusion of carbohydrate in the epitope might also explain the apparent 
rarity with which these antibodies are generated. The localization of the 
2G12 epitope is consistent with previous studies indicating that 2G12 forms 

20 a unique competition group (A. Pinter et al., J. Virol, 63:2674-2679 (1989); 
and W. Robey et al., Proc, Natl Acad. ScL U.S.A., 83:7023-7027 (1986)) and 
does not interfere with the binding of monomeric gpl20 to either CD4 or 
chemokine receptors (T. Draoic et al.. Nature, 381 :667-673 (1996)). Since 
the 2G12 epitope is predicted to be oriented towards the target cell upon 

25 CD4 binding (see below), the antibody may sterically impair interactions of 
the oligomeric envelope glycoprotein complex with host cell moieties. 

Possible orientations of the exterior glycoproteins in the trimer are 
significantly constrained by the requirement that observed and deduced 
binding sites for receptors and neutralizing antibodies, sites of N-linked 

30 glycosylation, and variable structures be exposed on the surface of the 

assembled complex. The two-domain CD4 in the ternary complex structure 
was aligned to the structure of four-domain CD429 to orient the trimer 
model with respect to the target cell membrane. The consequences of such 
a model, which is shown in Figure 4, are: a) the chemokine receptor-binding 
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sites are clustered at the vertex of the trimer predicted to be closest to the 
target cell; b) both variable and conserved neutralization epitopes are 
concentrated on the half of gpl20 facing the target cell; c) possibilities for 
intersubunit interactions among the variable structures that could help 
5 mask conserved neutralization epitopes are created; d) the subset of gpl20 
glycosylation sites to which complex carbohydrates are added in mammalian 
cells (L. Wu et al.. Nature, 384:179-183 (1996)) is well-exposed on the outer 
periphery of the trimer; e) the highly conserved surface near the pi helix is 
available for gp41 and/or gpl20 protein interactions within the trimers; and 

10 f) the surface of the assembled envelope glycoprotein complex is roughly 
hemispherical, thus minimizing the surface area of the viral spike that is 
potentially exposed to antibodies. 

In summary, the X-ray crystal structure of the gpl20 core /two- domain 
CD4/ 17b Fab complex provides a framework for visualizing key interactions 

15 between HIV-1 and the humoral immune system. Previous antibody 
competition analyses suggested that the gpl20 surface buried in the 
assembled trimer elicits non-neutralizing antibodies. By contrast, the 
binding sites for neutralizing antibodies cluster on a different gpl20 surface. 
Our structural studies disclose the existence of non-neutralizing and 

20 neutralizing faces of gpl20, and reveal another, immunologically "silent" 

face of the glycoprotein (Figure 3D). This outer domain surface, along with 
the major variable loops, contributes to the large fraction of the gpl20 
surface that is protected against antibody responses by a dense array of 
carbohydrates and by the capacity for variation. The conserved receptor- 

25 binding regions of gpl20 represent attractive targets for immune 
intervention. However, the elicitation of antibodies against these 
conformation-dependent structures has proven inefficient. Since the gpl20 
epitopes near the receptor-binding regions span the inner and outer 
domains, interdomain conformational shifts may decrease their 

30 representation in the immunogen pool. The recessed nature of the CD4- 
binding site likely contributes to its poor immunogenicity. The sequential 
recognition of two receptors by primate immunodeficiency viruses allows the 
conserved elements of the chemokine receptor-binding site to be created or 
exposed by the modified polypeptides described herein. 
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We Claim: 

1. A modified gpl20 polypeptide comprising portions of at least 
two conserved regions of an envelope protein selected from a primate 
lentivirus, wherein at least one of the following changes relative to the wild- 
type to gp 120 protein is made: 

(a) introduction of disulfide bonds; 

(b) filling a cavity of the gpl20 protein with hydrophobic amino 
acid residues; 

(c) introducing a Pro residue at a defined turn structure; or 

(d) increasing the hydrophobicity across the interface between the 
gpl20 domains, 

wherein the modified polypeptide maintains the overall 3-dimensional 
structure of a discontinuous conserved epitope of the wild-type gpl20. 

2- The modified gpl20 polypeptide of claim 1, wherein the 
discontinuous conserved epitope is a CD4BS epitope or CD4i epitope. 

3. The modified gpl20 polypeptide of claim 2, wherein the gpl20 
protein is selected from the group consisting of HIV- 1, HIV-2 and SIV. 

4. The modified gpl20 polypeptide of claim 3, wherein the gpl20 
protein is HIV-1, 

5. The modified gpl20 polypeptide of claim 4, wherein disulfide 
bonds are introduced between at least one of the groups of amino acids that 
correspond to Prol 18-Ala443, Leul22-Gly431, Phe210-Gly30, or Ser256- 
Phe376 of the HIV-1 HXBc2 strain. 

6. The modified gpl20 polypeptide of claims 4 or 5, wherein at 
least one amino acid residue corresponding to wild-t5^e gpl20 Ser375, 
Val255, Arg273, Ser481, Ser447, Asn377 of the HIV-1 HXBc2 strain, 
Thr283, or Asp477 of the HIV-1 HXBc2 strain, has been substituted with a 
hydrophobic amino acid residue. 

7. The modified gpl20 polypeptide of claim 6, wherein at least 
one of the following amino acid substitutions is present: 

Trp for Ser375, Val255 or Arg 273; 

Phe for Ser481; 

lie for Ser447 or Thr283; 

or Leu for Asn377 or Thr283. 
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8. The modified gpl20 polypeptide of claim 6, wherein a Pro 
residue has been introduced at a defined turn structure. 

9. The modified gpl20 polypeptide of claim 5, wherein a Pro 
residue has been introduced at a defined turn structure. 

10. The modified gpl20 polypeptide of claim 4, wherein a Pro 
residue has been introduced at a defined turn structure. 

11. The modified gpl20 polypeptide of claim 8, wherein a Pro 
residue has been substituted for Ile423. 

12. The modified gpl20 polypeptide of claim 9, wherein a Pro 
residue has been substituted for Ile423. 

13. The modified gpl20 polypeptide of claim 10, wherein Pro has 
been substituted for Ile423. 

14. The modified gpl20 polypeptide of claim 1, wherein at least 
two of the changes have been made, 

15. The modified gpl20 polypeptide of claim 14, wherein the 
discontinuous conceived epitope is selected from the group of epitopes 
consisting of CD4i, CD4BS, and 2G12 epitopes. 

16. The modified gpl20 polypeptide of claim 15, wherein at least 
three of the changes have been made. 
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later than the priorrty date claimed 



"T" later document published after the international filing date 
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Boston, MA 02109-4280 
UNITED STATES OF AMERICA 




POT 



ATION OF TRANSMITTAL OF 
ERNATIONAL SEARCH REPORT 
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Date of mailing 
(day/month/year) 



18/03/1999 



Appiicant's or agent's file reference 

48436-PCT 



FOR FURTHER ACTIOW 



See paragraphs 1 and 4 below 



IntematroruU application No. 
PCT/US 98/24001 



International firmg date 
(day/month/year) 



10/11/1998 



Applicant 

DANA-FARBER CANCER INSTITUTE et al 



1 . The applicant is hereby notified t\aX the Interr^tionaJ Search Report has been established and is transmitted herewith. 
Filing of amendments and staisment under Article 19; 

The applicant is entitled, if he so wishes, to amend the claims of the International Application (see Rule 46): 

When? The time limit for filing such amendments is normaily 2 montfTS from the date erf transmittal of the 
International Search Fteport; however, for more details, see the notes on the accompanying sheet 

Where? Directly to the International Bureau of WlPO 
34. ctiemin des Colombettes 
121 1 Geneva 20. Switzerland 
Fascifnile No.: (41-22) 740,14.35 

For more detailed instructions see the rwtes on the accompanying sheet, 

2. I I The applicant is hereby notified trat no tntemationai Search Report will be established and that the dectaration under 
— ' Article 1 7(2)(a) to that effect is tisismitted herewith. 



the applicant is r^fied that: 



^- CZl regard to the protest against payment of (an) additional fee<s) under Rule 40.2. 

rn the protest together with the decision thereon has been transmitted to the International Bureau together with the 
— applicant's request to fona?^ the texts of both the protest and the decision thereon to the designated Offices. 

I 1 ^ decision has been made yet on the protest; the applicant will be notified as soon as a decision is made. 

4, Further ac«ion(8): The applicant is reminded of tt>e foflowing: 

Shorfly after 18 months from the prionty date, the international application will be published by the International Bureau 
If the applicant wishes to avoid or postpone publication, a notice of withdrawal of the international application, or of the 
pnonty claim, must reach the Intemaional Bureau as provided in Rules 90b/s.1 and 90£)is.3 respectively before the 
comptetion of 0^ technical preparalior^ for intematior^l publication. 

Within 19 months from the priority dafi. a demand for intemationai preliminary examination must be filed if the applicant 
wishes to postpone the entry into the national phase unffl months from the priority date fm some Offices ev^ later). 

Within 20 months from tt^ priority daE. the applicant must perform the prescribed acts for entry into the national phase 
before al) designated Offices which fave not been elected in the demand or in a later election within 19 months from the 
pnority date or could not be elected tsecause they are not bour^ by Chapter II 
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European Patent Office, P.B. ai8 Patentlaan 2 
lsa.-2280 HV Riiswijk 

Tel. (+31-70) 340-2040. Tx. 3t 651 epo nl. 
Fax: (+31 -70) 340-3016 




Authorized officer 

Heike Zoglauer 



Form PCT/ISA/^20 (July 1998) 



NOTES TO FORM PCT/ISA/220 



Thea« NotM are jnt«nd«d to grv« th« base inalnjctiona coooeming tha filing of amendmanta undar articia 1 9. Tha 
Notea are baaad on tha raquiremanta of tha Patant Cooparation Troaly. tha Regulabona and lha A^intstrativa Inatrxiotiona 
ufxier that Traaty. In caaa of diacrapancy balarean these Notea and thoaa requiramarta, the latter m applicable For more 
detailed mformation, aae alao tha PCX Appicanl's Guide, a put>tication ol W1PO. 

In these Notea. 'Artida", •Ru*a', and 'Section* refer to theproviaions of the PCT, tha PCX RegufiaCiona and the PCX 
Adminiatrative Inatructiona raapactively. 



INSTRUCTIONS CONCERNING AMENDMENTS UNDER ARTICLE 19 

The applicant haa, after having received the international aearch report, one opportunity to amend the daima of the 
international application. It ahould however be amphaaized that, since afl parts of the international application (claims, 
description arwj ^awinga) may be amended during the international preliminary exwnination prooedure. there is usually 
no need to f9e amendments of the daima undar Artieie 1 9 except where, e.g. the applicant wants thm latter to be published 
for the purposes of provisional protection or has another reaaon tor amending the claims before ir^emational pbulication. 
Furthermore, it ahould be emphasized that provisional protection is avaiable in some States only. 

What parts Off the Memattonal application may be amended? 

LMer Artide 1 9, only the claims may be amended. 

During the international phase, the daima may also bm amended (or further amenM) under Article 34 betoe 
the International Proliminafy Examining Authority. Xhe description and drawings may only be amended urider 
Artide 34 before the International Examining Authority. 

Upon entry into the national phase, all parts of the international application may be mended under Artide 28 
or, where applicable. Artide 41 . 

When? WitNn 2 montha from the dite of tranamittal of the international aearch report or 1 6 months from the priorty 

date, whichever time limit exprn later. It should be noted, however, that the amendhienta will be considered 
as having been received on time if they are received by the Intemationai Bureau afiar the expiration ol the 
applicable time limit but before ttie completion of the technical prep^tfatibna for intmatiortaf public^ion 
(Rule 46.1). 

Where not to lae tha ametidmenU? 

Xhe amendmenta may only be fled with the Intemationai S^a-eau and not with the leceiving Office or the 
Intem^ional Searching Authority (Rule 46.2). 

Where a demand for intemationai preliminary examination haa been^s filed, see bafc>w. 

E*her by canoetttng one or more entire daima, by addbig one or more new daims or by amending the text of 
one or more of the daims as fied. 

A replacement sheet must be submitted for each sheet of the daims which, on aocoiait of an amernlment or 
amerKfrnents, differa from tha sheet originally filed. 

Al the daims appearing on a replacement aheet must l» numbered in Arabic numwia. Where a dakn is 
cancelled, r>o renumbering of the other daima ia required. In all cases where daims are renumbered, they muat 
*» renumtaered consecutively (Adminiatrative Instructions, Section 205(b)). 



The amendments muat be Made In the language in which ttie IntemaUonal ^pOcatlon ia to be pubOahad. 

What documerrts muat/may accompany the amendmanU? 
Latter (Section 205(b)): 

The amendments must be sulvnitted with a letter. 

Xhe letter wifl not be pubtishad wdh the international appfication and the amended d«na. It should not be 
confused with the 'Statemert under Article 1 9(1)" (see below, under 'Statement urvter Article 19(1)*). 

T>«a Mar muat be In English or French, at t»>e choice of the appUcanC However, If the language of t»»e 
Mem^fcmal application Is EngHah, the letter muat be In Engllah; If the langiage of the Intematlor^ wpttcatton 
la French, the latter must be In French. 
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NOTES TO FORM PCT/^A/220 (continued) 



Thm M«r mutt indicate the diffenMioe« between the daims as filed and the daims as amended. It must, in 
particular, inacate, in connection with each datm appearing in the international application Ot toeing understood 
that identicaJ ifHjications cor>oeming several claims may be ^uped), whether 

(0 the daim is unchanged; 

Cn) the claim is cancelted; 

fiii) the claim is new; 

{(v) the claim replaoes one or more daims as filed; 

(v) the claim is the result of the division of a daim as filed. 



The foaofwing examples IINjslnle the manner In which amendments must be expialRad In the 
accompanying letter 

1 (Where ortginaSy there were 48 daims and after amendment of some daims there are 51J: 

XIaims 1 to 29. 31 . 32, 34. 35. 37 to 48 repleoed by a mended claims beanng the same numbers; 
daims 30, 33 and 36 unchanged; new daims 49 to 51 «ddad.* 

2. (Where originally there were 15 daims and after amerw*nent of all daims there «e 1 1 J: 
"Claims 1 to 15 replaced by anrwrvled daims 1 to 1 1 • 

3. (Where originaily there were 14 daims and the amendments consist in cancelling some claims and in addlrtg 
new daims]: 

*aaims 1 to 6 and 14 unchanged; daims 7 to 13 canceled; new daims 15, 16 ar^ 17 added.* or 
*Oaims 7 to 1 3 cancelled; nnv daims 15,16 and 1 7 added; aO other daims unchanged.* 

4. (Where various kinds of amendments are made]: 

*aatms 1 -1 0 unchanged; daams 1 1 to 1 3. 1 8 and 1 9 canoeBed; daims 1 4. 1 5 and 1 6 replaced by mended 
daim 14; daim 17 subdivided into amended daims 15, 16 and 17; new daims 20 and 21 added." 



"Statement under article 19(ir (Ride 46.4) 

The amendments may kw aocompviied by a statar?>ent explaining the amendments and indicating any impact 
that such amendments might Ham on the description and the drawirxis (which caniKt be amended under 
Article 19(1)) 

The fltaiement wUI bm published ««th the international appicitfion and the amended dam. 
R muat be In the language bi wlilch the International apppftcatlon la to be puMiahed. 
It must be brief, not exceeding 500 words if in English or if translated into English. 

It should not be confused with and does not replace the tetter indicabng the deferences between the claims 
as fied and as amer>ded. It must be faed on a separate sheet and must be identified as such by a headir^, 
pre«Brat>fy by using the words "StiAement under Article 1 9(1).* 

It may not contain any disparagmg comments on the intemotionaJ search report or the relevance of cttations 
contamed in that report. Reference to citations, relevant to a given daim, contair>ed in the international search 
report may be made only in oonnsction with an amendment of that claim. 



Conaequence If a demand lor International praOmlnary examination has already been filed 

If, at the tin»e of filing any anr>enc*nerts under Article 1 9, a demand for international pmfiminary examination 
has already been submitted, the applicant must preferalsly, at the same time of filing the amer>dments with the 
International Bureau, also file a copy of such amendmenU with the International Preliminary Examining 
Authority (see Rule 62.2(a). first sentence) 



Consequence with regard to translation of thelntematlonal application for entry Into the nation^ phase 

The applicant's ittantion is drawn to the fact that, wtiere upon entry into the national phase, a translation of the 
dmtrm a amended under Artide 19 may have to be furnished to the designated^lected Offices, instead of, or 
in addftion to, the translation of the daims as filed. 

For fialher details on the re<|uir6ments of each designatec^ieeted Office, see Volume D of the PCT Applicant's 
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FOR FURTHER Notification of Transmittal of intOTational Search Report 
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ACTION 


International application No. 

PCT/US 98/24001 


Interrwtionait filing date (day/month/year) 

10/1 1/1998 


(Earliest) Prionty Date (day/month/year) 

10/11/1997 


Applicant 

DANA-FARBER CANCER INSTITUTE et al . 



TNs Interr^ational Search Report has been prepared by (his lntematior>al Search^^ Authority arxj is transmiOed to the applicant 
according to Article 18. A copy is being transmitted to&re Inteniational Bureau. 

This International Search Report consists of a total of 3 sheets. 

I X I It is also accompanied by a copy of eadi prior art document cited in this report 



1 . Basis of the report 

a. With regard to the language, the international search was carried out on the basis of tfte intematior^ application in the 
language in which it was filed, unless otherwt^ indicated under this item. 



□ 



the international search was carried ouc on the basis of a translation of the international appfication furnished to this 
Authority (Rule 23.1(b)). 



With regard to any nucleotide and/or amino add sequence disclosed in the international application, the interr^tional search 

was carried out on the basis of the sequence feting : 

I I contained in the international application in written form. 

filed together with the international aptpHcation in computer readatHe form. 

furnished subsequently to this Authorily in written form, 

furnished subsequently to tiiis Authorily in computer readble form. 



□ 
□ 
□ 
□ 

□ 



the statement that the subsequently furnished written sequence fisting does rwt go beyond the disclosure in the 
international application as filed has been furnished. 

the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
furnished 



2. 
3. 



I I Certain cftaims ware found unsearchable (See Box I). 
I I Unity of invention is lacking (see Box II). 



4. With regard to the tttle, 

[X] the text is approved as submitted by tfre applicant. 

I I the text has been established by this Aeithority to read as follows: 



With regard to the abstract, 

I X I the text IS approved as submitted by the applicant. 

□ the text has been established, accordrrg to Rule 38.2(b). by this Authority as it appears in Box III. The applicant may, 
within one month from the date of maiirig of this interr^tional search report submit comments to this Authority. 

The figure of the di j wings to be put>lished with the abstract is Figure No. 



[ I as suggested by the applicant. [X] None of Oie figures. 

I I because tie applicant faded to suggest a figure. 

I I t>ecause this figure better characterizes the invention. 
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vol. 11, no. 1, 1995, pages 185-188, 
XP002093620 

see the whole document 

ROVINSKI B ET AL: "EXPRESSION AND 
CHARACTERIZATION OF GENETICALLY ENGINEERED 
HUMAN IMMUNODEFICIENCY VIRUS-LIKE 
PARTICLES CONTAINING MODIFED ENVELOPE 
GLYCOPROTEINS: IMPLICATIONS FOR 
DEVELOPMENT OF A CROSS-PROTECTIVE AIDS 
VACCINE" 

JOURNAL OF VIROLOGY, 

vol. 66, no. 7, 1 July 1992. pages 

4003-4012, XP000560157 

see the whole document 
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which is cited to establish the ptt»fication date of another 
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other mearts 
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cited to understand the prirx^ple or theor/ mdertying the 
invention 

"X" document of particular relevance; the claimed irwention 
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Involve an inventive step wt>en the document is taken alone 
document of particular relevarKe; the claimed invention 
canrK^t be considered to involve an inventive step when the 
document is comboied with one or more other such docu- 
ments, such combination being obvious to a person skilled 
fn the art. 
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